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(57) Abstract 

Disclosed and claimed are novel Badl- 
lus thuringiensis isolates which have lepi- 
dopteran activity. Thus, these isolates, or 
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insect pests. Further, genes encoding novel 
^<endotoxins can be removed from the iso- 
lates and transferred to other host micxobes, 
or plants. Expression of the ^-endotoxins in 
such hosts results in the control susceptible 
insect pests in the environment of such hosts. 
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DESCRIPTION 



OR-1 ON ORPHAN RECEPTOR BELONGING TO THE 
NUCLEAR RECEPTOR FAMILY 



Cross-Reference to a Related Applicaiion 
This is a continuation-in-part of co-pending application Serial No. 08A)32,77S, filed March 
16, 1993, which is a continuation of application Serial No. 07/597,607, filed October 15, 1990, now 
abandoned. 



Background of the Invention 
The soil microbe Bacillus thuringiensis (At) is a Gram-positive, spore-forming bacterium 
characterized by parasporal crystalline protein inclusions. These inclusions often appear 
microsoopicaUy as distinctively shaped ciysuls. These ciystalline proteins can be profbrms of 6- 

15 endotcmns which are highly toxic to pests and specific in their toxic activity. Certain Rl 

endotoxin genes have been isolated and sequenced, and recombinant DNA-based Bx products 
have been produced and approved. In addition, with the use of genetic engineering techniques, 
new approaches for delivering Rt endotoxins to agricultural enviromnents are under development, 
including the use of plants genetically engineered with endotoxin genes for insect resistance and 

20 the use of stabilized intact microbial cells as B.l endotoxin delivery vehicles (Gaertner, F.H., L. 

Kim [1988] TIBTECH 6:S4-S7). Thus, isolated Ba. endotoxin genes are becoming commercially 
valuable. 

Until the last ten years, commercial use of Bx pesticides has been largely restricted to 
a narrow range of lepidopteran (caterpillar) pests. Preparations of the spores and crystals of A 

25 thuringiensis subsp. kurstaki have been used for many years as commercial insecticides for 

lepidopteran pests. For example, B. thuringiensis var. kurstaki HD-1 produces a delta endotoxin 
which is toxic to the larvae of a number of lepidopteran insects. 

In recent years, however, investigators have discovered Rl pesticides with specificities for 
a much broader range of pests. For example, subspecies of B.l, namely israelensis and son diego 

30 (a.k.a. Rt tenebrionis, aJca. M-7), have been used commercially to control insects of the orders 

Diptera and Coleoptera, respectively (Gaertner, F.H. [1989] "Cellular Delivery Systems for 
Insectiddal Proteins: Living and Non-living Microorganisms,** in Controlled Delivery of Crop 
Protection Agents, R.M. Willdns, ed., Taylor and Francis, New York and London, 1990, pp. 245- 
255). See also Couch, T.L. (1980) ''Mosquito Pathogenicity of Bacillus thuringiensis var. 

35 israelensis,^ Devehpm&us in Industrial Microbiology 22:61-76; Beegle, CC, (1978) ''Use of 

Entomogenous Bacteria in Agroecosystems,** Developments in Industrial Microbiology 20:97-104. 
Krieg, A., A.M. Huger, G.A. Langenbru(±, W. Schnetter (1983) 2L ang. Ens. 96:500-508, describe 
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a Rl isolate named BacUbis thurin^ensis var. tenebrionis, which is reportedly active against two 
beetles in the order Coleoptera. These are the Colorado potato beeUe, Leptinouma decemlineaui, 
and Agelastica aim. 

Recently, new subspecies of Rt have been identified, and genes responsible for active d- 
endotoxin proteins have been isolated (Hofte, R, KR. Whiieley [1989J Micrdbiologicai Reviews 
52(2):242-255). Hofte and Whiteley classified Rt crystal protein genes into 4 major classes. The 
classes were Qyl (Lepidoptera-spedfic), CiyH (Lepidopiera- and Diptera-spedfic), Oym 
(Coleoptera-spedfic), and OrylV (Diptera-specific). The discoveiy of strains specifically toxic to 
other pests has been reponed. (Feitelson. J.S., J. Payne, L. Kim [19921 Bio/Technology 10:271-275). 

The cloning and expression of a B.t crystal protein gene in Escherichia coti has been 
described in the published literature (Schnepf. RE., RR, Whiteley [1981] Proc Natl Acad. ScL 
USA 78:2893-2897). VS. Patent 4,448,885 and US, Patent 4,467,036 both disclose the expression 
of Rt. crystal protein in £ coU. U.S. Patents 4,797,276 and 4,853331 disclose R Omringiensis var. 
san diego (a.k.a. Rt tenebrionis, aJuu M-7) whidi can be used to control coleopteran pests in 
various environments. U.S. Patent No. 5,164,180 discloses a Rl isolate, PS81A2, which is active 
against lepidopteran pests. U.S. Patent No. 5,151363 discloses certain isolates of An which have 
activity against nematodes. Many other patents have issued for new Rl isolates and new uses of 
B.L isolates. The discovery of new Rl isolates and new uses of known Rl isolates remains an 
empirical, unpredictable arL 

Brief Sum mary of the Invention 

The subject invention concerns novel BaciUus thuringiensis isolates which have activity 
against lepidopteran pests. 

Spedficalty, the invention comprises novel Rl isolates and mutants thereof, and novel 
delta endotoxin genes obtainable from these Rl isolates which encode proteins which are active 
against lepidopteran pests. 

Brief Des cription of the Drawing 

Figiure 1 shows the one-letter amino add sequence of the Generic Formula (SEQ ID NO. 
27). Numbering is for convenience and approximate location only. In the Generic Formula, the 
N-tcrminal half of the molecule is comprised of residue nos. 1-638. The C-ierminal half is 
comprised of residues 639-1213. Wherein 

A^ala G«gly Ms met S^ser 

C^Ofs H«his Naasn T«thr 

' D = asp I = ile P » pro V a val 

E = glu K = lys Q = gln W^trp 

F « phe L 8 leu R a arg Y =^tyr 
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k«KorR 

z = G, S, D, or N 

j = E, Q, R, or K 

X « G, S, D, N, E, Q, R» or K 

u = Q P» T, or A 

b = M. I, U V. or F 

o = QP,T,A,M.I,L,V,orF 



- 3= any naturally occurring amino add 

. =B any naturally occurring amino add or complete omission thereol 



Brief Description of the Sequences 
S£Q ID NO. 1 is the nudeotlde sequence of the gene 81A2. 
SEQ ID NO. 2 is the amino add sequence of the toxin 81A2. 
SEQ ID NO. 3 is the nucleotide sequence of the gene 91C2. 
SEQ ID NO. 4 is the amino acid sequence of the toxin 91C2. 

SEQ ID NO. 5 is a radiolabeled oligonucleotide probe used in RFLP analysis as described 
in Example 3. 

SEQ ID NO. 6 is a forward oligonucleotide primer used to amplify gene 91C2 according 
to the subjea invention. 

SEQ ID NO, 7 is a reveise oligonudeotide primer used to amplify gene 91C2 according 
to the subject invention. 

SEQ ID NO. 8 is a synthetic oligonucleotide probe used to identify gene 91C2 according 
to the subject invention. 

SEQ ID NO. 9 is the peptide sequence encoded by probes for CrylF gmes. 

SEQ ID NO. 10 is a nucleotide probe according to the subject invention. 

SEQ ID NO. 11 is the peptide sequence encoded by probes for CryEF genes. 

SEQ ID NO. 12 is a nudeotide probe according to the subjea invention. 

SEQ ID NO. 13 is the peptide sequence encoded by probes for CiylF genes. 

SEQ ID NO. 14 is a nudeotide probe according to the subject invention. 

SEQ ID NO. 15 is the peptide sequence encoded by probes for CiylF genes. 

SEQ ID NO. 16 is a nudeotide probe according to the subject invention. 

SEQ ID NO. 17 is the pq)tide sequence encoded by probes for CiylF genes. 

SEQ ID NO. 18 is a nucleotide probe according to the subject invention. 

SEQ ID NO. 19 is the peptide sequence encoded by probes for CrylF genes. 

SEQ ID NO. 20 is a nucleotide probe according to the subject invention. 

SEQ ID NO. 21 is the peptide sequence encoded by probes for CrylF genes. 
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SEQ ID NO. 22 is a nucleotide probe according to the subjea invention. 
SEQ ID NO. 23 is the peptide sequence encoded by probes for CcylF genes. 
SEQ ID NO. 24 is a nucleotide probe according to the subject invention. 
SEQ ID NO. 25 is the peptide sequence encoded by probes for CrylF genes. 
5 SEQ ID NO. 26 is a nucleotide probe according to the subject invention. 

SEQ ID NO. 27 is the Generic Formula according to tiie subject invention. 



Detailed Disclosure of the Invention 
The subject invention concerns isolates of BacUius thuring^nsis having anti-lepidopteran 

10 activity. These isolates comprise genes which code for <$-endotQxins» which toxins are responsible 

for the observed anti-lepidopt^an activity. Thus, the subject invention concerns anti-lepidopteran 
Rl isolates, anti-lepidopteran JBLt toms» and genes which encode tiiese todns. Rirther 
embodiments of the subject invention concern recombinant hosts transformed witii genes encoding 
the anti-lepidopteran An toxins. The subject invration further concerns methods for controlling 

15 lepidopterans, said methods comprising the use of the isolates, toxins, genes, and recombinant 

hosts of the subjea invention. 

Specifically exemplified herein are the isolates designated JBLt PS81T1, B.L PS53C2, Rl 
PS31F4, B.t PS86V1. B.L PS 8612, Rt PS73E, B.t PS81K, B.t PS83E2, Rt PS81E, Rl PS81Z3, 
Rl PS53B5, B,L PS83R, Rl PS53B2, Rl PS83N2, B.L PS81B5, B.L PS86W1, and B.L PS93C2. 

20 . Also specifically exemplified is the toxin designated 91C2 and the gene which encodes this toxin. 

The 91C2 gene is a CiylF gene. CrylF is a subclass of genes within the lepidopteran-active Cryl 
dass of At genes. The discovery described in the subjea application enables a person skilled in 
the art to identify other CrylF toxins (and genes coding for these toxins) having anti-lepidopteran 
activity. The toxins of the subjea invention are cfaaraaerized as being active against lepidopterans 

25 and having one or more of the foUowing diaraaeristics: 

1. A high degree of amino add homology with toxin 91C2. 

2. A nudeotide sequence encoding the toxin wherein the nudeotide sequence 
hybridizes with probes or genes disclosed herein. 

3. A nucleotide sequence encoding the toxin wherein the nudeotide sequence can 
30 be amplified by PGR using primers disdosed herein. 

4. An amino add sequence which conforms to the Generic Formula presented 
herein. 

5. Immunoreactivity to an antibody raised to toxin 91C2. 



35 
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Bacillus ihuringiensis isolates useful aoconiing to the subjea invention have the foUowing 
characteristics in their biologically pure form: 



Table 1. Taxonomic characterization of the B,l isolates of the subjea invention 

Approx. Toxin 
Isolate Crystal Type MW (kD) 









Serot]^pe 


Activity 




DipyTanua 




aizawai 


Lepidoptera 




DipyTalalQ 


1 'in iCA 
13U, dU 


Icurstaki 


Lepidoptera 




DipjTanua 


13U, 60 


kurstaki 


Lepidoptera 


PS86V1 


bipyramid 


130 


galleriae 


Lepidoptera 


PS86I2 


bipyramid 


130 


morrisoni 


Lepidoptera 


PS73E 


bipyramid 


130 


aizawai 


Lepidoptera 


PS81K 


bipyramid 


130 


aizawai 


Lepidoptera 


PS83E2 


amorphic 


130 


aizawai 


Lepidoptera 


PS81E 


bipyramid 


130 


aizawai 


Lepidoptera 


PS81Z3 


bif^mid 


130 


aizawai 


Lepidoptera 


PS53B5 


bipyramid 


130,60 


kenyae 


Lepidoptera 


PS83R 


bii^ramid 


130 


aizawai 


Lepidoptera 


PS53B2 


bipyramid 


130,60 


kei^e 


Lepidoptera 


PS83N2 


bipyramid 


130.60 


sotto/kenyae 


Lepidoptera 


PS81B5 


amorphic 


130 


aizawai 


Xepldoptera 


PS86W1 


bipyramid 


130 


galleriae - 


Lepidoptera 


PS91C2 


bipyramid 


130 


morrisoni 


Lepidoptera 



Bx isolates useful according to the subject Invention have been deposited. Also deposited 
are recombinant microbes comprising the B.l genes of interest The cultures have been deposited 
in the permanent coUection of the Patent Culture CoUecUon (NRRL), Regional Research Center. 
1815 North University Street, Peoria, Illinois 61604 USA. 



Culture Accession Number Deposit Date 

BacUhis thurin^ensis PS81IA NRRL B-18484 April 19, 1989 

Bacillus ihuringiensis PS91C2 NRRL B-18931 December 27, 1991 

£1 coU NM522 0)MYC2361) NRRL B-21016N December 17. 1992 

The subject cultures have been deposited under conditions that assure that access to the 
cultures will be available during the pendency of this patent application to one determined by the 
Commissioner of Patents and Trademarks to be entitled thereto under 37 CFR 1.14 and 35 U.S.C 
§122. The deposits are available as required by foreign patent laws In countries wheicin 
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counterparts of the subject application, or its pr geny, are filed. However, it should be 
understood that the availability of a deposit does not constitute a license to practice the subject 
invention in derogation of patent rights granted by governmental action. 

Further, the subjea culture deposits will be stored and made available to the public in 
accord with the provisions of the Budapest Treaty for the Deposit of Microorganisms, Le., they 
will be stored with all the care necessaiy to keep them viable and uncontaminated for a period 
of at least five years after the most recent request for the furnishing of a sample of the deposit, 
and in any case, for a period of at least 30 (thirty) years alter the date of deposit or for the 
enforceable life of any patent which may issue disclosing the cultures. The depositor 
acknowledges the duty to replace the deposits should the depository be unable to furnish a sample 
when requested, due to the condition of the deposit(s). All restrictions on the availability to the 
public of the subject culture deposits will be irrevocably removed upon the granting of a patent 
disclosing them. 

Toxins and genes . The toxins and genes according to the subjea invention include not 
only the full length sequences disclosed herein but also fragments of these sequences, longer 
sequences, and fusion proteins, which retain the charaaeristic pesticidal activity of the toxins 
specifically exemplified herein. 

One aspect of the subject invention concerns the disooveiy of a generic chemical formula 
hereinafter referred to as the Generic Formula (SEQ ID NO. 27), whidi can be used to identii^ 
toxins having aaivi^ against lepidopterans. The Generic Formula describes toxin proteins having 
molecular weights of about 130 kDa. 

The Generic Formula is shown in Figure 1 designated by a one-letter amino add 
sequence. The Sequence Listing provided herein according to the Patentin format utilizes the 
three-letter amino acid code and has no provision for showing a choice between two amino adds 
at a given position. Therefore, within the Patentin Sequence Listing, -Xaa" is used to denote 
points of variation within a sequence, but the single letter code of Figure 1 should be referred to 
for the spedfic^amino add substitutions which are acceptable at a given location in the sequence. 

Further guidance for diaracterizing the lepidopteran toxins of the subjea invention is 
provided in Tables 2 and 3, which demonstrate the relatedness among tnins within the known 
Ciyl subclasses of lepidopteran toxins. These tables show a numeric score for the best matching 
alignment between two proteins that reflects: (1) positive scores for exact matches, (I) positive 
or negative scores reflecting the likelihood (or not) of one amino add substituting for another in 
a related protein, and (3) negative scores for the introduction of gaps. A protein sequence aligned 
to itself will have the highest possible score. Le., all exact matches and no gaps. However, an 
unrelated protein or a randomly generated sequence wiU typically have a low positive score. 
Related sequences have scores between the random background score and the perfea match score. 
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The sequence comparisons reported herein were made using the algorithm of Smith and 
Waterman ([1981] Advances in Applied Mathematics 2:482^), implemented as the program 
-Bestfir in the GCG Sequence Analysis Software Package Version 7, April 1991. The sequences 
were compared with default parameter values (comparison: Swagappep.Cmp, Gap: 3.0. length 
weight: 0.1). The program output value is referred to as the Quality score. 

Tables 2 and 3 show the pairwise alignment scores between the indicated amino acids of 
the Ciyl toxin proteins. Table 4 shows the amino adds compared from the proteins of interest. 

Table 2 shows the scores prior to adjustment for random sequence scores. Note that for 
each subclass, the highest alignment score is alwsQfs with another toxin protein from the same 
subclass. For example, the highest aUgnment score with CiyIA(a), aside from itself, is with 
CryIA(d). Funhennore. CryIA(a) scores highest with aU three other CiylA toxin proteins. In a 
similar manner, other Cryl toxins score highest with other members of the same subclass. Of 
particular relevance to the subjea invention is the fea that the CrylF toxin proteins score highest 
with each other. 

Table 3 shows the same analysis after subtraction of the average score of 50 alignmente 
of random shuffles of the column sequences with the row sequences. Note that in Table 3 the 
same relationships hold as in Table 2. Lc, toxin proteins score highest with other members of the 
same subclass. Again, the two CiylF toxin proteins score highest with each other. Examination 
of the adjusted alignment scores for members of the same subclass reveals that Qyl subclasses can 
be defined as those proteins with adjusted alignment scores of about 450 or greater. 

Thus, certain toxins of the subjea invention can be defined as those whi<di have 
lepidopieran activity and have an alignment value of 450-500 or greater with CiyIF(a) or CryIF(b) 
(91C:2). As.used herein, the term -alignment value" refers to the adjusted scores obtained above 
and used to create the scores reported in Table 3. 
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Table 4. 



Protein 


Amino adds compared 


CiyIA(a) 


1-607 


CryIA(b) 


1-608 


ciyiA(cj 


1-609 


CryIA(d) 


1-607 


OylB 


1-636 


QylC 


1-617 


CiylD 


1-592 


CryIE(a) 


1-601 


CryIE(b) 


1-599 


CryIF(a) 


1-601 


CiylFfb) f91C2) 


1-600 



15 Toxins of the subject invention are specifically exemplified herein by the toxin encoded 

by the gene designated 91CZ Since this toxin is merely exemplary of the toxins of the subjea 
invention, it should be readily apparent that the subject invention further comprises variant tming 
(and nucleotide sequences coding for variant toxins) having the same, or essentially the same, 
biological activity against lepidopterans of 91C2. These equivalent toxins will have amino add 

20 homology with 91C2. This amino add homology will typically be greater than 75%, preferably 

be greater than 90%, and most preferably be greater than 95%. The amino add homology will 
be highest in certain critical regions of the toxin which account for biological activity or are 
involved in the determination of three-dimensional configuration wiiich ultimately is responsible 
for the biological activity. In this regard, ceruin amino add substitutions are acceptable and can 

25 be readily made in regions whidi are not critical to activity or are conservative amino add 

substitutions which do not affect the three-dimensional configuration of the "K^lecule. For 
example, amino adds may be placed in the following classes: non-polar, undiarged polar, basic, 
and addle. Conservative substitutions whereby an amino add of one class is replaced with another 
amino add of the same type £all within the scope of the subject invention so long as the 

30 substitution does not materially alter the biological activity of the compound. Table 5 provides 

a listing of examples of amino adds belonging to each class. 
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Table 5 

Class of Amino Add Examples of Amino Adds 

Nonpolar Ala, Val, Leu, Be, Pro, Met, Phe, Ttp 
Uncharged Polar Gly. Ser, Tta. Cyi,Tyt, Asn, Gin 
Acidic Asp. Glu 
Lys,AiB,His 

In some instances, non-conservative substitutions can also be made. The critical feaor 
is that these substitutions must not significantly detraa from the biological activiqr of the toxin. 

Tlie toxins of the subject invention can also be characterized in terms of die shape and 
location of toxin inclusions, whidi are described above. 

It should be apparent to a person skiUed in tiiis an tiiat genes encoding lepidopteran- 
active toxins can be identified and obtained through several means. Tlie specific genes exemplified 
herein may be obtained from Uie isolates deposited at a culture depositoiy as described above. 
These genes, or portions or variants Uiereof, may also be constructed synUieticaUy. for example, 
by use of a gene machine. As used herein, the terms "varianu" or "variations" of genes refer to 
nucleotide sequences which code for Uie same toxins or which code for equivalent toxins having 
lepidopteran activity. Variations of these genes may be readily consirurted using standard 
techniques for making point mutations. Also, fragments of tiiese genes can be made using 
commercially available exonucleases or endonucleases according to standard procedures. For 
example, oizymes such as Bal31 or site-dfrected mutagenesis can be used to systematically cut off 
nucleotides from tiie ends of tiiese genes. Also, genes which code for active fragments may be 
obtained using a variety of oOiw restricticm en^es. Proteases may be used to directly obtain 
active fragments of these toxins. 

Equivalent toxins and/or genes encoding tiiese equivalent toxins can also be located from 
B.t isolates and/or DNA Ubraries using tiie teachings provided herein. There are a number of 
methods for obtaining ttie pestiddal toxins of tiie instant invention. For example, antibodies to 
tiie pestiddal toxins disclosed and daimed herein can be used to identify and isolate oUier toxins 
from a mixture of proteins. Specifically, antibodies may be raised to tiie portions of tiie toxins 
which are most constant and most distinct from otiier At toxins. These antibodies can tiien be 
used to spedfically identify equivalent toxins with the characteristic activity by 
immunopredpitation, enzyme linked immunosorbent assay (EUSA), or Western blotting. 
Antibodies to tiie toxins disdosed herein, or to equivalent toxins, or fragments of tiiese toxins, can 
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readily be prepared using standard procedures in this art The genes coding for these toxins can 
then be obtained from the micioorgenisnL 

A further method fbr identifying the toxins and genes of the subject invration is through 
the use of oligonucleotide probes. These probes are detectable nucleotide sequences. These 

5 sequences may be detectable by virtue of an appropriate label or may be made inherently 

fluorescent as described in International Patent Application No. WO93/16094. As is well known 
in the art, if the probe molecule and nucleic add sample hybridize by forming a strong bond 
between the two molecules^ it can be reasonably assumed that the probe and sample have 
substantial homology. Detection of the probe provides a means for determining in a known 

10 manner whether faybridi2ation has occurred. Such a probe analysis provides a rapid method for 

identifying toxin-encoding genes of the subject invention. 

The nucleotide segments ^ch are used as probes according to the invention can be 
synthesized by use of DNA synthesizers using standard procedures. In the use of labeled 
nucleotide segments as probes, the particular probe is labeled with any suitable label known to 

15 those skilled in the art, including radioactive and non-radioaaive labels. Typical radioaaive labels 

include ^P, ^^I, ^S, or the like. A probe labeled with a radioactive isotope can be constructed 
from a nucleotide sequence complementary to the DNA sample by a conventional nick translation 
reaction, using a DNase and DNA polymerase. The probe and sample can then be combined in 
a hybridization buffer solution and held at an appropriate temperature until annealing occurs. 

20 Preferably, hybridization is conducted under stringent conditions by techniques well known in the 

art, as described, for example, in Keller, G.H., M.M Manak (1989) DNA Probes, Stockton Press, 
New York, NY, pp. 169-170. Thereafter, the membrane is washed firee of extraneous mat^ials, 
leaving the sample and bound probe molecules typically detected and quantified \sy 
autoradiography and/or liquid scintillation counting. 

25 Non-radioactive labels include, for example, ligands such as biotin or thyroxine, as well 

as enr^es such as hydrolases or perixodases, or the various chemiluminescers such as ludferin, 
or fluorescent compounds like fluorescein and its derivatives. The probe may also be labeled at 
both ends with different types of labels for ease of separation, as, for example, by using an isotopic 
label at the end mentioned above and a biotin label at the other end. 

30 Duplex formation and stability depend on substantial complementarity between the two 

strands of a hybrid; a certain degree of mismatch can t>e tolerated. Therefore, the probes of the 
subjea invention indude mutations (both single and multiple), deletions, insertions of the 
described sequences, and combinations thereof wherein said mutations, insertions and deletions 
permit formation of stable l^brids with the target polynucleotide of interest Mutations, 

35 insertions, and deletions can be produced in a given polynucleotide sequence in many ways, and 

these methods are known to an ordinarily skilled artisan. Other methods may become known in 
the future. 



wo 96/05314 



PCT/US9S/10310 



13 

The known methods include^ but are not limited to: 

(1) synthesizing cfaemicaUy or otherwise an artificial sequence which is a mutation, 
insertion or deletion of the known sequence; 

(2) using a probe of the present invention to obtain via hybridization a new sequence 
5 or a mutation, insertion or deletion of the probe sequence; and 

(3) mutating, inserting or deleting a test sequence in vitro or in vivo. 

It is important to note that the mutational, insertional, and deletional variants generated 
from a given probe may be more or less efficient than the original probe* Notwithstanding such 
differences in efficiency, these variants are within the scope of the present invention. 

10 Thus, mutational, insertional, and deletional variants of the disclosed sequences can t>e 

readily prepared by methods which are well known to those skilled in the art These variants can 
be used in the same manner as the instant probes so long as the variants have substantial 
sequence homology with the probes. As used herein, substantial sequence homology refers to 
homology which is sufficient to enable the variant to function in the same capacity as the original 

15 probe. Preferably, this homology is greater than 50%; more preferably, this homology is greater 

than 75%; and most preferably, this homology is greater than 90%. The degree of homology 
needed for the variant to funaion in its intended capacity will depend upon the intended use of 
the sequence. It is well within the skill of a person trained in this art to make mutational, 
insertional, and deletional mutations which are designed to improve the function of the sequence 

20 or otherwise provide a methodological advantage. 

Specific nucleotide probes useful according to the subjea invention in the rapid 
identification of CrylF class toxin genes intlude: 

(i) DNA coding for a peptide sequence "Ser Thr Gly Arg Leu Pro Leu Asp** (SEQ 
ID NO. 9). A specific example of such a probe is ''ACTACWGGMA 

25 GRTTACCRTT RGAY** (SEQ ID NO. 10); 

(ii) DNA coding for a peptide sequence "Glu Asp Ser Pro Val Ser Ala Asn" (SEQ 
ID NO. 11). A specific example of such a probe is ''GARGATTCWC 
CAGTWTCWGC WAAT** (SEQ ID NO, 12); 

(iii) DNA coding for a peptide sequence "Asn Gly Phe Asn Arg Ala Glu Phe Gly 
30 Val** (SEQ ED NO. 13). A specific example of such a probe is - AATGG WTTTA 

ATAGTGCTGA ATTTOGGAGT W (SEQ ID NO. 14); 

(iv) DNA coding for a peptide sequence "Val Thr Ala Glu Thr Val Arg Ser Gin Thr** 
(SEQ ID NO. 15). A specific example of such a probe is ''GTAACWGCAG 
ARACWGTWAG WAGTCAAACW** (SEQ ID NO. 16); 

35 (v) DNA coding for a peptide sequence "Val Phe Asn Pro Gly Gly Ala lie Trp lie 

Ala Asp Glu** (SEQ ID NO. 17). A specific example of such a prot>e is 
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"GTMITYAATC CWGOWGGMGC MATWTGGATW GCWOATOARG AT" 
(SEQ ID NO. 18); 

(vi) DNA coding for a peptide sequence "Val Arg Gty Gly Phe GljT (SEQ ID NO. 

19). A specific example of su(± a probe is ''GTMMGAGGWGGWTTTGGR** 
(SEQ ID NO. 20); 

(vii) DNA coding for a peptide sequence **Gly Thr Asn His Thr Arg Thr- (SEQ ID 
NO. 21). A specific example of such a probe is '*GGWACRAAYC 
AYACMMGAAC W" (SEQ ID NO. 22); 

(viii) DNA coding for a peptide sequence **Val Arg Trp Pro Gly Glu De- (SEQ ID 
NO. 23). A specific example of such a probe is "GTWMGATGGC 
CWGGWGARAT W (SEQ ID NO. 24); 

(ix) DNA coding for a peptide sequence -Ser Asp Ser Trp Arg Ala" (SEQ ID NO. 
25). A specific example of such a probe is " AGTGATTCWT GGAGAGCW" 
(SEQ ID NO. 26). 

Because of the redundancy of the genetic code, i.e., more than one coding nucleotide 
triplet (codon) can be used for most of the amino acids used to make proteins, different 
nucleotide sequences can code for a parucular amino acid. Thus, the amino add sequences of the 
B.U toxins and peptides can be prepared by equivalent nucleotide sequences encoding the same 
amino add sequence of the protein or peptide. Accordingly, the subject invention includes such 
equivalent nudeotide sequences. Also, inverse or complement sequmces are an aspea of the 
subject invention and can be readily used by a person skilled in this art 

^ecombiiiant lypsts. The toxin-encoding genes harbored by the isolates of the subjea 
invention can be introduced into a wide variety of microbial or plant hosts. Expiession of the 
toxin gene results, direaly or indirectly, in the intracellular production and mainti^nanr^ of the 
pesticide. With suitable microbial hosts, e.g., Pseudomonas, live microbes can be appUed to the 
situs of lepidopterans where they will proliferate and be ingested by the pesL The result is a 
control of this pest Alternatively, the microbe hosting the toxin gene can be treated under 
conditions that prolong tiie activity of the toxin and stabilize the ceU. The treated ceU, which 
retains the toxic activiqr, then can be applied to the environment of the target pest. 

Where the At toxin gene is introduced via a suitable vector into a microbial host, and 
said host is appUed to the environment in a Uving state, it is essential that certain host microbes 
be used- For example, microorganism hosts can be selected which are known to occupy tiie sofl. 
These microorganisms are seleaed so as to be capable of successfuUy competing in tiie soU witii 
tiie wild-type microorganisms. It is also important that tiiqr provide for stable maint^pym^ and 
expression of Uie gene expressing the polypeptide pestidde, and. desirably, provide for improved 
protection of the pestidde from environmental degradation and inactivation. 
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A large number of microorganisms are known to inbabit tbe rbizosphere (the soil 
surrounding plant roots). These microorganisms include bacteria, algae» and fungL Of particular 
interest are microorganisms, sucb as baaeria, e.g., genera Baeiibis, Pseudmianas, Erwbua, Senatia, 
Klebsiella, Xanthomonas, Streptomyees, Rhizobium, Rhodopseudomonas, Methylophilius, 
5 Agrobaaerium, Acetobacter, Lactobacillus, Arthrobacter, Azotobacter, Leuconostoc, Alcaligmes and 

Clostridium; fimgi, particularly yeast, e.g., genera Saccharomyces, Oyptococcus, Ktuyveromyces, 
SpoTobolomyces, Rhodotorula, and Aureobasidium; microalgae, e.g., £amilies Cyanophyceae, 
Prochlorophyceae, Rhodophyceae, I^phyceae, Chrysophyceae, Prymnesiophyceae, Xanxhophyceae, 
Raphidophyceae, Bacillariophyceae, Eustigmatophyceae, Cryptophyceae, Euglenophyceae, 

10 Prasmophyceae^ and ChUmphyceae. Of particular interest are such phytosphere baaerial species 

as Pseudomanas syringae^ Pseudamonas ftuorescens, Serratia marcescens^ Acetobacter xyHnum^ 
Agrobacterium tumefaciens^ Rhodopseudomonas sj^eroides, Xanthommas campestris^ Rhizobium 
melioti, AlcaUgenes entrophus^ and Azotobacter vinlandU; and pl^osphere yeast species sudi as 
Rhodotorula rubra^ R gfutinis, R marina^ R aurannaca, Cryptococcus albidus, C diffluens, C 

15 laurentii, Saccharomyces rosei, 5. pretoriensis, S. cerevisiae, Sporobolomyces roseus, S. odonts, 

Khiyyeromyces veronae, and Aureobasidium polbilans. Of particular interest are the pigmented 
microorganisms. 

A wide variety of ways are available for introducing a JBLt gene encoding a toxin into a 
microorganism host under conditions which allow for stable maintenance and expression of the 

20 gene. These methods are well known to those skilled in the art and are described, for example, 

in United States Patent No. 5,135^7, which is incorporated herein by reference. 

T^tment of cells. As mentioned above, Rl or recombinant cells expressing a B.L toxin 
can be ueated to prolong the toxin activity and stabilize the celL The pesticide microcapsule that 
is formed comprises the B,L toxin within a ceOular struaure that has been stabilized and will 

25 protect the toxin when the microcapsule is applied to the environment of the target pesL Suitable 

host cells may include either prokaryotes or eukaiyotes, normally being limited to those cells 
which do not produce substances toxic to higher organisms, such as mammnu However, organisms 
which produce substances toxic to higher organisms could be used, where the toxic substances are 
unstable or the level of application sufficiently low as to avoid any possibility of toxicity to a 

30 m a mm a lian host As hosts, of particular interest will be the prokaiyotes and the lower eukaiyotes, 

such as fungi 

The cell will usually be intact and be substantially in the proliferative form when treated, 
rather than in a spore form, although in some instances spores may be employed. 

Treatment of the microbial cell, e.g., a microbe containing the toxin gene, can be by 
35 chemical or physical means, or by a combination of chemical and/or physical means, so long as 

the technique does not deleteriously affect the properties of the toxin, nor diminish the cellular 
capabili^ of protecting the toxin. Examples of chemical reagents are halogenating agoits. 
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particularly halogens of atomic no. 17-80. More particulars^ iodine can be used under mild 
conditions and for sufficient time to addeve the desired results. Other suitable tedmiques 
include treatment with aldehydes* sudi as formaldehyde and glutaraldehyde; anti-infectives, such 
as zephiran chloride and oetylpyridinium dhloride; alcohols, such as isopropyl and ethanol; various 
histologic fixatives, such as Lugol iodine, Bouin*s fixative, and Hell/s fixative (See: Humason, 
Gretchen L., Animal Tissue Techniques^ W.H. Freeman and Company, 1967); or a combination 
of physical (heat) and chemical agents that preserve and prolong the activity of the toxin produced 
in the cell when the cell is administered to the host's environmenL In one preferred embodiment, 
acids can be used to stabilize the cells. Examples of physical means are short wavelength radiation 
such as gamma-radiation and X-radiation, freezing, UV irradiation, lyophilization, and the like. 
Methods for treatment of microbial cells are disclosed in United States Patent Nos. 4,695,455 and 
4,695,462, which are incorporated herein by reference. 

The cells generally will have enhanced structural stabili^ which will enhance resistance 
to environmental conditions. Where the pesticide is in a proform, the method of cell treatment 
should be selected so as not to inhibit processing of the proform to the mature form of the 
pesticide by the target pest pathogen. For example, formaldehyde will crosslink proteins and 
could inhibit processing of the proform of a polypeptide pesticide. The method of treatment 
should retain a substantial portion of the bio-availability or bioaaivity of the toxin. 

Characteristics of particular interest in selecting a host cell for purposes of production 
include ease of introducing the B.l gene into the host, availabiliqr of expression systems, efficiency 
of expression, stability of the pesticide in the host, and the presence of auxiliaiy genetic 
capabilities. Characteristics of interest for use as a pesticide microcapsule include protective 
qualities for the pesticide, such as thick cell walls, pigmentation, and intracellular packaging or 
formation of inclusion bodies; survival in aqueous environments; lack of mammalian toxicity; 
attraaiveness to pests for ingestion; ease of killing and fixing without damage to the toxin; and 
the like. Other considerations include ease of formulation and handling, economics, storage 
stability, and the like. 

Growth of cells. The ceUular host containing the Rt insectiddal gene may be grown in 
any convenient nutrient medium, where the DNA construa provides a selective advantage, 
providing for a selective medium so that substantially all or all of the cells retain the Rt gene. 
These cells may then be harvested in accordance with conventional ways. Alternatively, the cells 
can be ueated prior to harvesting. 

The Bx cells of the invention can be cultured using standard art media and fermentation 
techniques. Upon completion of the fermentation cycle the bacteria can be harvested by first 
separating the B.L spores and crystals from the fermentation broth by means well known in the 
art The recovered Rl spores and crystals can be formulated into a wettable powder, liquid 
concentrate, granules or other formulations by the addition of surfactants, dispersants, inert 
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carriers, and ther components to facmtate handling and appHcation for paiticular target 
These formulations and application procedures are all well known in the art. 

Formulations, Formulated bait granules containing an aitractant and spores and ciystals 
of the B.L isolates, or recombinant microbes comprising the gene(s) obtainable from the Rl 
isolates disclosed herein, can be applied to the soil. Formulated product can also be applied as 
a seed-ooating or root treatment or total plant treatment at later stages of the crop cycle. 

As would be appreciated by a person skiUed in the art. the pestiddal concentration wiU 
vary wtoely depending upon the nature of the particular formulation, particularly whether it is a 
concentrate or to be used directiy. The pesticide wiU be present in at least 1% by weight and may 
be 100% by weighL The dry formulations will have from about 1-95% by weight of the pesticide 
while the liquid formulations will generally be from about 1-60% by weight of the solids in the 
liquid phase. The formulations will generaUy have from about 10^ to about 10^ cells/mg. These 
formulations will be administered at about 50 mg (liquid or dry) to 1 kg or more per hectare. 

The formulations can be applied to the environment of the lepidopteian, e.g., soU. by ' 
spraying, dusting, sprinkling, or the like. 

Mutants. Mutants of the novel isolates of the invention can be made by procedures weU 
known in the an. For example, an asporogenous mutant can be obtained tiuough ethylmethane 
sulfonate (EMS) mutagenesis of a novel isolate. The mutants can be made using ultraviolet Ught 
and nitrosoguanidine by procedures well known in the art 

A smaller percentage of the asporogenous mutants vnll remain intaa and not tyse for 
extended fermentation periods; tiiese strains are designated lysis minus (-). Lysis minus strains can 
be identified by screening asporogenous mutants in shake flask media and selecting those mutants 
that are still intact and contain toxin ciystals at the end of the fermentation. Lysis minus strains 
are suitable for a cell fixation process that will yield a protected, encapsulated toxin protein. 

To prepare a phage resistant variant of said asporogenous mutant, an aliquot of the phage 
lysate is spread onto nutrient agar and allowed to diy. An aUquot of the phage sensitive bacterial 
strain is tiien plated directiy over the dried lysate and aDowed to dry. The plates are incubated 
at 3a»C The plates are incubated for 2 days and, at Uiat time, numerous colonies could be seen 
growing on tiie agar. Some of these colonies are picked and subcultuied onto nutrient agar plates. 
These apparent resistant cultures are tested for resistance by cross streaking witii tiie phage lysate. 
A line of tiie phage lysate is streaked on Uie plate and aUowed to diy. Tlie presumptive resistant 
cultures are Uien streaked across tiie phage line. Resistant bacterial cultures show no lysis 
anywhere in tiie streak across tiie phage line after overnight incubation at 30ra The resistance 
to phage is Uien reconfirmed by plating a lawn of tiie resistant culture onto a nutrient agar plate. 
The sensitive strain is also plated in the same manner to serve as the positive controL* After 
drying, a drop of the phage lysate is plated in Uie center of tiie plate and allowed to diy. 
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Resistant cultures showed no lysis in the area where the phage lysate has been placed after 
incubation at 30^ for 24 hours. 

Following are examples which illustrate procedures, including the best mode, for 
5 practicing the invention. These examples should not be construed as limiting. All percentages 

are by weight and all solvent mixture proportions are by volume unless otherwise noted. 

Example 1 — Culturine of the Bx Isolates of the Invention 

A subculture of a novel Rt isolate, or mutants thereof can be used to inoculate the 
10 following medium, a peptone, glucose, salts medium. 



Bacto Peptone 7*5 gfi 

Glucose 1.0 g/i 

KH2PO4 3-4 gA 

K2HPO4 435 g/1 

15 Salt Solution 5.0 mVl 

CaCl2 Solution 5.0 ml/L 

Salts Solution (100 ml) 

MgSO^-TH^O 2.46 g 

20 MnS04 H20 0.04 g 

ZnS04-7H20 0.28 g 

FeS04-7H20 0.40 g 

CaCl2 Solution (100 ml) 

25 CaCl2-2H20 3.66 g 

pH7,2 



The salts solution and CaQ2 solution are filter-sterilized and added to the autoclaved and 
cooked broth at the time of inoculation. Flasks are incubated at 30^C on a rotary shaker at 200 
30 rpm for 64 hr. 

The above procedure can be readily scaled up to large fennentors by procedures well 
known in the arL 

The S.L spores and/or crystals, obtained in the above fermentation, can be isolated by 
procedures well known in the aru A frequently*used procedure is to subjea the harvested 
35 fermentation broth to separation tediniques, e.g., oratrifugation. 
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Example 2 — Aaivitv of At Isolates Against LepidopteraTis 

The following strains have been tested for anti-lepidopteran activity with the following 

results: 



5 Table 6. Bioassay results 

% Mortality 



Strain 






PS81T1 








inn inn 




PS31F4 




100, 100 


PS86V1 


100 




PS86I2 


100,92 




PS73E 


100,100 




PS81K 


100, 100 




PS83E2 


100, 100 




PS81E 


100,92 




PS81Z3 


100 




PS53B5 




100 


PS83R 


100 




PSS3B2 




100 


PS83N2 


100 




PS81B5 


100.100 




PS86W1 


100 




PS91C2 


100. 100 




PS81A2 


100, 100 





Spodomera exieua bioassav procedure. Rt cultures were harvested and resuspended in 
sterile deionized water. Fned volumes of each culture were incorporated into USDA Insect Diet 
30 (Technical Bulletin 152S, U.S. Department of Agriculture. 1976). TwenQr-four neonate 51 exigua 

were exposed to the diet for 6 days. Mortally readings were taken at this time. 

Trichoplusia ni bioassav procedure. B.L cultures were harvested and resuspended in sterile 
deionized water. Fixed volumes of each culture were top loaded onto USDA Insect Diet Ttays 
were infested with neonate 7. nu After 6 days mortality was determined. 

35 

Example 3 - Characterization of Toxin Genes bv RFLP Analysis 

Total cellular DNA was prepared from Bacillus thwingiensis (Rt) cells grown to an 
optical density, at 600 nm, of 1.0. The cells were recovered by centrifugation, and protoplasts 
were prepared in TES buffer (30 mM TOs-HQ, 10 mM EDTA, 50 mM NaCl, pH = 8.0) 
40 containing 20% sucrose and 50 mgAnl lysoi^e. The protoplasts were lysed by addition of SDS 

to a final concentration of 4%. The cellular material was precipitated overnight at 4"^ in 100 mM 
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(final ooncemration) neutral potassium chloride. The supemate was extracted twice with 
phenol/chlOTOfbnn (1:1). Hie DNA was precipitated with ethanol and purified by isopycnic 
banding on a oesfum chloride-ethidium bromide gradienL 

Total cellular DNA isolated £rom &l cells was digested with a restriction endonuclease 
5 and separated by electrophoresis on a 0.8% (w/v) agarose-TAE (50 mM THs-HQ, 20 mM NaOAc, 

2.5 mM EDTA. pH = 8.0) buffered gel. A Southern blot of the gel was hybridized with the p^PJ- 
radiolabeled oligonucleotide probe, ATGATTCATGCGOCAGATA (SEQ ID NO. 5), and then 
washed to remove unbound radioactivity. The blot was exposed to KODAK X-OMAT™ film 
using standard autoradiography techniques. The results are an array of hybridizing bands 
10 (fingerprint) which correspond to toxin genes or toxin gene fragments. This type of 

characterization is known as Restriction Fragment Length Polymorphism (RFLP) analysis which 
classifies each isolate by a distinct DNA fingerprint. 



Table 7. DNA fingerprints for B.t isolates of the subjea invention 



15 


Isolate 


Hybridizing HinAHl Fragments (Kb) 




PS81T1 


1.13, 3,0, 9.4 




PSS3C2 


1.0S2. 5.8, 6.6 




PS31F4 


5.5. &0 




PS86V1 


5.5, 6.0, 6.6 


20 


PS86I2 


5.0, 6.6, 7.5, 12 




PS73E 


1.052, 1.13, 3.0, 8.5 




PS81K 


3JZ, 7.5, 9.4, 13 




PS83E2 


3.2, 8.5, 12 




PS81E 


1.13, 3.2, 9.4 


25 


PS81Z3 


1.13, 3.0, 8J 




PSS3B5 


1.13, 3.0, 7.5 




PS83R 


1.13, 3.0, 8.5, 12 




PS53B2 


1.052, 1.13, 3A 7.5 




PS83N2 


5.5 


30 


PS81B5 


8.0, 13 




PS86W1 


5.5, 6.6 




PS91C2 


1.13, 3.0, 6.0, 7.5, 8.5 




PS81A2 


13, 16 



35 
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Table 8. 


Hybridizing HmdlU fragments otRt isolates 




of the subject invention 


bolate 


Novel Hybridizing HindlU Fragments (<-Kb) 




3.0, oJO, 1j5 


PS83E2 


3J2 


PS86I2 


SJO 


PS31F4 


5.5, 8.0 


PS53C2 


5.8 


PS81T1 


9.4 



Example 4 - Molecular Qoning and Eroression of a No vel QvIF Toirin Gene from BacUlus 
thurin^ensis Strain PS91C2 

Total cellular DNA was prepared from Bacillus thuringiensis (Rl) cells grown to an 

IS optical density, at 600 nm, of 1.0. Cells were pelleted by centrifugation and resuspended in 

protoplast bufifer (20 mg/ml lysozyme in 0.3 M sucrose, 25 mM Tris-Q [pH 8.0], 25 mM EDTA). 
After incubation at ST^C for 1 hour, protoplasts were lysed by two cycles of freezing and thawing. 
Nine volumes of a solution of 0.1 M NaCl, 0.1% SDS, 0.1 M Tris-Q were added to complete lysis. 
The cleared lysate was extracted twice with phenolxhloroform (1:1). Nucleic acids were 

20 precipitated with two volumes of ethanol and pelleted by centrifugation. The pellet was 

resuspended in TE buffer and RNase was added to a final concentration of 50 /ig/ml After 
incubation at 37**C for 1 hour, the solution was extracted once each with phenolxhloroform (1:1) 
and TE-saturated c^oroform. DNA was precipitated from the aqueous phase by the addition of 
one-tenth volume of 3 M NaOAc and two volumes of ethanoL DNA was pelleted by 

25 centrifugation, washed with 70% ethanol, dried, and resuspended in TE buffer. 

A 1.58 kbp fragment of the novel 130 kDa toxin gene was obtained by polymerase rhain 
reaction (PGR) amplification from PS91C2 cellular DNA using the following primers: forward 
5'-GAGTGGGAAG CAGATCTTAA TAATGCACAA TFAAGGO' (SEQ ID NO. 6) and 
reverse 5'-ATAC(C or T)CGATCGATATGATA(G or A)TCOGT-3' (SEQ ID NO. 7). This 

30 DNA fragment was cloned into pBluescript SyK (Stratagene, La JoUa, CA) and the DNAsequence 

determined by dideoxynucleotide sequencing methodology (Sanger et at [1977] Proc NatL Acad, 
ScL USA 74:5463-5467) using Sequenase (U.S. Biochemicals, Cleveland, OH). DNA sequences 
unique to the CrylF gene were identified by computer comparison with other Cryl genes. An 
oligonucleotide probe with the following sequence was synthesized: 5'- 

35 CCCAATOTGAATGTACnTGCGC.3' (SEQ ID NO. 8). This probe was radiolabeled with 
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and used in standard hybridizations of Southern blots of PS91C2 total cellular DNA. Hybridizing 
bands induded an approximate 13 kbp HinWl fragment 

A gene library was constructed from PS91C2 DNA partially digested with NdeU. Partial 
restriction digests were fractionated by agarose gel electrophoresis. DNA fragments 93 to 23 kbp 
in size were excised from the gel, electroeluted from the gel slice^ purified on an Elutip D ion 
exchange column (Schleicher and Schuell, Keene, NH), and recovered by ethanol precipitation. 
The NdeR inserts were ligated into ^amHI-digested LambdaGem-11 (Promega, Madison, WI). 
Recombinant phage were packaged and plated on EL coli KW251 cells. Plaques were screened by 
hybridization with each of the respeaive probes described above. Hybridizing phage were plaque- 
purified and used to infect liquid cultures of £ coU KW2S1 cells for isolation of DNA by standard 
procedures (Maniatis et aL, supra). 

For subdoning the gene encoding the 130 kDa OylF toxin, preparative amounts of phage 
DNA were digested with Sau3A and electrophoresed on agarose gel. The approximate^ 8 kbp 
band containing the toxin gene was excised from the gel, electroeluted from the gel slice* and 
purified by ion exchange chromatography as described above. The purified DNA insert was 
ligated into an ATtoI-digested pHTBluell (an E. colilB. thuringiensis shuttle veaor comprised of 
pBluescript S/K (Stratagene) and the replication origin from a resident Rt plasmid (D. Lereclus 
et al [1989] FEMS Microbiol Lett. 60:211-218). The ligation mix was used to transform frozen, 
competent £L coli NM522 cells (ATCC 47000). )8-galactosidase transformants were screened by 
restriction digestion of alkaline lysate plasmid minipreps as above. The desired plasmid construct, 
pNfYC2361, contains a toxin gene that is novel compared to other toxin genes containing 
insectiddal proteins. 

pNfYC2361 was introduced into the aciystalliferous (Cry*) Bx host, CryB (A Aronson, 
Purdue Univarsity» West Lafayette, IN) by electroporation. Esqiression of the 130 kDa toxin was 
demonstrated tyy SDS-PAGE analysis. NaBr-purified cxystals were prepared (PDannenstieU MA. 
et al [1984] FEMS Microbiol Lett. 21:39) for determination of toxicity of the cloned gene product 
to Phuella xylostella by the screening method described in Example 3. The LC^ for the CtyJF 
toxin against P. xylostella was determined to be S /4g toxin/ml diet 

Example 5 - Insertion of Toxin Genes Into Plants 

One aspea of the subject invention is the transformation of plants with genes encoding 
a lepidopteran toxin. The transformed plants are resistant to attack by lepidopterans. 

Genes encoding lepidopteran-active toxins, as disclosed herein, can be inserted into plant 
cells using a variety of techniques which are well known in the art. For example, a large number 
of cloning veaors comprising a replication system in E. coil and a marker that permits selection 
of the transformed cells are available for preparation for the insertion of foreign genes into higher 
plants. Thevect rs comprise, for example, pBR322, pUC series, M13mp series, pACyci84, etc 
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Acooidingfy, tbe sequence encoding tlie Rl toxin can be inserted into the vector at a sniuble 
lestricticn site. Tbe residting plasmid is osed for transfonnation into £ coff. TbeRcoUcdis 
are cultivated in a suitable nutrient medium, tben Harvested and lysed. The plasmid b tecoveied. 
Sequence analysis, restriction analysis, electrophoresis, and other bfochemical-molecuiar biological 
methods are generally carried out as methods of analysis. After each manipulation, the DNA 
sequence used can be cleaved and joined to the next DNA sequence. Each plasmid sequence can 
be cloned in the same or other plasmids. Depending on the method of inserting desired genes 
into the plant, other DNA sequences may be necessaiy. If, for example, the Ti or Ri plasmid is 
used for the tiansfoimation of the plant cell, then at least the right border, but often the right and 
the left border of the Ti or Ri plasmid T-DNA, has to be Joined as the flanking region of the 
genes to be inserted. 

TTie use of T-DNA for the transformation of plant cells has been intensive^ researched 
and sufSdently described in EP 0 120 516; Hoekema (1985) In: The Binary Plant Vector System, 
Ofbet-durkkerij Kanters B. V., Albhisserdam. Chapter 5; Fialey etal,Crit. Rev. Plant ScL 4:1-46; 
and An et aL (1985) EMBOJ. 4:277-287. 

Once the inserted DNA has been integrated in the genome, it is relatively stable there 
and, as a rule, does not come out again. It normally contains a selection marker Uiat confers on 
the transformed plant cells resistance to a biodde or an antibiotic, such as kanamydn, G 418, 
bleomycin, hygromycin. or chloramphenicol, inter aUa. The individual^ employed marker should 
accordingly permit the selection of transformed cells raUier than cells diat do not contain the 
inserted DNA. 

A large number of techniques are available for inserting DNA into a plant host cdl. 
Tbose techniques include transformation witii T-DNA using Agrobaaerium tumefadens or 
Agrobaaerium rhizogenes as transformation agent, fusion, injection, or electropoiation as well as 
other possible methods. If agrobacteria are used for the transformation, tiie DNA to be insoted 
has to be cloned into special plasmids. namety either into an intermediate vector or into a binaiy 
vector. The intermediate vectors can be integrated into the Ti or Ri plasmid by homologous 
recombination owing to sequences fliat are homologous to sequences in U»e T-DNA. The Ti or 
Ri plasmid also comprises tiie vir region necessaiy for tiie transfer of the T-DNA. Intermediate 
vectors cannot replicate tiiemseKes in agrobacteria. The intermediate vector can be transferred 
into Apobacterium tumefadens by means of a helper plasmid (conjugation). Binaiy vectors can 
replicate tiiemselves both in £ coB and in agrobacteria. They comprise a selection marker gene 
and a linker or polylinter which are ftamed by tiie right and left T-DNA border regtons. TTiey 
can be transformed directly into agrobacteria (Holsters « aL [1978] MoL Gen. Genet. 163:181-187). 
The agrobaaerium used as host ceU is to comprise a phismid carrying a vir i^on. Hie vir i^on 
is necessaiy for tiie transfer of tiie T-DNA into tiie plant celL Additional T-DNA be 
contained. The bacterium so transformed is used for tiie uansformation of plant cells. Plant 
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explants can advantageously be cultivated with Agrobaaerium tumefadens ot Agrobacienum 
rhiz€»genes for the transfer of the DNA into the plant oelL Whole plants can then be regenerated 
from the infected plant material (for example, pieces of leaf, segments of stalk, roots, but also 
protoplasts or suspension-cultivated cells) in a suitable medium, whidi may contain antibiotics or 
5 biocides for selection. The plants so obtained can then be tested for the presence of the inserted 

DNA. No special demands are made of the plasmids in the case of injection and electroporation. 
It is possible to use ordinary plasmids, such as, for example, pUC derivatives. 

The transfonned cells grow inside the plants in the usual manner. They can form germ 
cells and transmit the transformed trait(s) to progeny plants. Such plants can be grown in the 
1 0 normal manner and crossed with plants that have the same transfonned h^editaiy factors or other 

hereditary fectors. The resulting hybrid individuals have the corresponding phenolic properties. 

Example 6 - Qoninp of Novel B.t Genes Into Insect Viruses 

A number of viruses are known to infect insects. These viruses include, for example, 

1 5 baculoviruses and entomopoxvinises. In one embodiment of the subject invention, lepidopteran- 

active genes, as described herein, can be placed with the genome of the insect virus, thus 
enhancing the pathogenicity of the virus. Methods for constructing insea viruses which comprise 
B,t. toxin genes are well known and readily practiced by those skilled in the art. These procedures 
are described, for example, in Merryweather et al (Merryweather, AT., U. Weyer, MP.G. Harris, 

20 M. Hirst, T. Booth, R.D. Possee [1990] /. Gen. ViroL 7i:1535-1544) and Martens a oL (Martens, 

J.W.M., G. Honee, D. Zuidema, J.W.M. van Lent, B. Visser, J.M. Vlak [1990] AppL Environmental 
Microbiol 56(9)2764-2770). 

It should be understood that the examples and embodiments described herein are for 
25 illustrative purposes only and that various modifications or changes in light thereof will be 

suggested to persons skilled in the art and are to be included within the spirit and purview of this 
application and the scope of the appended claims. 
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SEQUENCE LISTING 



(1) GEMERAI. INFORMATION: 



<i) APPLICANT NAME(S): 
STREET ADDRESS: 
CITY: 

STATE/PROVINCE : 
COUNTRY: 
POSTAL CODE /ZIP: 
PBONE NUMBER: 
FAX NUMBER: 



MYCOGEN CORPORATION 
5501 oberlin Drive 
San Diego 
California 

US 

92121 

(619) 453-8030 
(619) 453-6991 



(ii) TITLE OF INVENTION: Protein Toxins Active Against Lepidopteran Pests 
(iii) NUMBER OF SEQX7SNCES: 27 

(iV) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Saliwanchik & Saliwanchik 

(B) STREET: 2421 N.W. 41st Street, Suite A-1 

(C) CITY: Gainesville 

(D) STATE: FL 

(E) COUNTRY: US 

(F) ZIP: 32606 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release «1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/597,607 

(B) FILING DATE: 15-OCT-90 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Saliwanchik, David R. 

(B) REGISTRATION NUMBER: 31,794 

(C) REFERENCE /DOCKET NXAIBER: MA50.C1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (904)375-8100 

(B) TELEFAX: (904)372-5800 



(2) INFORMATION FOR SEQ ID NO:l8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI •SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Bacillus ^urlnglenais 

(B) STRAXN: aizavai 

(C) ZNDZVZDUAI. ISOLATE: PS81A2 

(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Lambdagem - 11 (tm) Library of August Sick 

(B) CLONE: 81A2 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATGGAGAATA ATATTGAAAA TCAATGCATA CCTTACAATT GTTTAAATAA TCCTGAAGTA 60 

GAGATATTAG GGATTGAAAG GTCAAATAGT AACGTAGCAG CAGAAATCGG CTTGGGGCTT 120 

AGTCGTCTGC TCGTTTCCCG AATTCCACTA GGGGATTTTA TACTTGGCTT GTTTGATGTA 180 

ATATGGGGGG CTATAGGTCC TTCACAATGG GATATATTTT TAGAGCAAAT TGAGCTATTG 240 

ATCGGCCAAA GAATAGAGGA ATTCGCTAGG AATCAGGCAA TTTCTAGATT ACAAGGGCTA 300 

AGCAATCTTT ACCGAATTTA CACAAATGCT TTTAAAAACT GGGAAGTAGA TCCTACTAAT 360 

CCAGCATTAA GAGAAGAGAT GCGTATTCAA TTTAATGACA TGAACA6TGC TCTTACAACA 420 

GCTATTCCTC TTTTTTCAGT TCAAGGTTAT GAAATTCCTC TTTTATCAGT ATATGTTCAA 480 

GCTGCAAATT TACATTTATC GGTTTTGAGA GATGTTTCAG TGTTTGGACA ACGTTGGGGA 540 

TTTGATGTAG CAACAATCAA TAGTCGTTAT AATGATTTAA CTAGGCTTAT TGGCGAATAT 600 

ACTGATTATG CTGTACGTTG GTATAATACG GGGTTAAATC GTTTACCACG TAATGAAGG6 660 

GTACGAGGAT GGGCAAGATT TAATAGGTTT AGAAGAGAGT TAACAATATC AGTATTAGAT 720 

ATTATTTCTT TTTTCCAAAA TTACGATTCT AGATTATATC CAATTCCGAC AATCTATCAA 780 

TTAACGCGGG AAGTATATAC AGATCCGGTA ATTAATATAA CTGATTATAG AGTTACCCCA 840 

AGTTTCGAGA GTATTGAAAA TTCAGCTATT AGAAGTCCCC ATCTTATGGA TTTCTTAAAT 900 

AATATAATTA TTGACACTGA TTTAATTAGA GGCGTTCACT ATTGGGCGGG 6CATCGTGTA 960 

ACTTCTCATT TTACCGGTAG TTCGCAAGTG ATAAGCTCCC CTCAATACGG GATAACTGCA 1020 

AACGCAGAAC CGAGTCGAAC TATTGCTCCT AGCACTTTTC CAGGTCTTAA TCTATTTTAT 1080 

AGAACACTAT CAGACCCTTT CTTCCGAAGA TCCGATAATA TTATGCCAAC ATTAGGAATA 1140 

AATGTAGTGC AGGGGGTAGG ATTCATTCAA CCAAATAATG GTGAAGTTCT ATATAGAAGG 1200 

AGAGGAACAG TAGATTCTCT TGATGAGTTG CCAATTGACG GTGAGAATTC ATTAGTTGGA 1260 

TATAGTCATA GATTAAGTCA CGTTACATTA ACCAGGTCGT TATATAATAC TAATATAACT 1320 

AGCTTGCCAA CATTTGTTTG GACACATCAC AGTGCTACTG ATC6AAATAT AATCTATCCG 1380 

GATGTAATTA CACAAATACC ATTGGTAAAA TCATTCTCCC TTACTTCA6G TACCTCTGTA 1440 

GTCAGAGGCC CAGGATTTAC AGGAGGGGAT ATCATCCGAA CTAACGTTAA TGGTAATGTA 1500 

CTAAGTATGA GTCTTAATTT TAGTAATACA TCATTACAGC GGTATCGCGT GAGAGTTCGT 1560 

TATGCTGCTT CTCAAACAAT GGTCATGAGA GTAAATGTTG GAGGGAGTAC TACTTTTGAT 1620 

CAAGGATTCC CTAGTACTAT GAGTGCAAAT GGGTCTTTGA CATCTCAATC ATTTAGATTT 1680 
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GCAGAATTTC 


CTGTAGGCAT 


TAGTACATCT 


GGCAGTCAAA 


CTCCTGGAAT 


AA6TATAAGT 


1740 


A&TAATCCA6 


GTAGACAAAC 


6TTTCACTTA 


GATAGAATTG 


AATTTATCCC 


A6TTGATGCA 


1800 


ACATTTGAAG 


CAGAATATGA 


TTTAGlUkAGA 


GCACAAAAGG 


CGGTGAATTC 


GCTGTTTACT 


1860 


TCTTCCAHTC 


AAATCGAGTT 


AAAAACAGAT 


GTGACGGATT 


ATCATATTGA 


TCAAGTATCC 


1920 


AA.TTTA6TA6 


ATTGTTTATC 


CGATGAATTT 


T6TCTGGATG 


AAAAGCGAGA 


ATTGTCCGAG 


1980 


AAACTCAA&C 


ATGCGAAGCG 


ACTCAGTGAT 


GAGCGGAATT 


TACTTCAAGA 


TCCAAACTTC 


2040 


A6AGGGATCA 


ATAGGCAHCC 


AGACCGTGGC 


TGGAGAGGAA 


GTACGGATAT 


TACCATCCAA 


2100 


GGAGGAGAT6 


ACGTA7TCAA 


ACAGAATTAC 


GTCACACTAC 


CAGGTACCTT 


TGATGAGTGC 


2160 


TA<rCCAAC6T 


ATTTGTATCA 


AIUkAATAGAT 


GA6TCGAAAT 


TAAAAGCCTA 


TAACCGTTAC 


2220 


GAATTAAGAG 


GGTATATCGA 


A6ATAGTCAA 


GACTTAGAAA 


TCTATTTAAT 


TCGCTACAAT 


2280 


GCAIUU^CACG 


AAACAGTAAA 


TGTACCAGGT 


ACGGGTTCCT 


TATGGCCGCT 


TTCAGTCGAA 


2340 


AGTCCAATTG 


GAAGGT6TGG 


AGAACCGAAT 


CGGTGTGTGC 


CACACCTTGA 


ATGGAATCCT 


2400 


GATTTAGATT 


GTTCCTGCAG 


AGACGGGGAA 


AAATGTGCAC 


ATCATTCCCA 


TCATTTCTCC 


2460 


TTCGACATTG 


ATGTTCGATG 


CACAGACTTG 


CAAGAGGATC 


TAGGCGTG7G 


GGTTGTATTC 


2520 


AAGATTAAGA 


C6CAGGAAGG 


TTATGCAAGA 


TTAGGAAATC 


7GGAATTTAT 


CGAAGAGAAA 


2580 


CCATTAATTG 


GAGAAGCACT 


GTCTCGTGTG 


AAGAGAGCGG 


AAAAAAIkATG 


GA6AGACAAA 


2640 


CGGGAAAAAC 


TACAATTGGA AACAAAACGA 


GTATATACAG 


AGGCAAA2kGA 


AGCTGTGGAT 


2700 


GCTTTATTCG 


TAGATTCTCA ATATGATAGA 


TTACAAGCAG 


ATACAlkAGAT 


TGG7AT6ATT 


2760 


CATGCGGCAG 


ATAGACTTGT 


TCATCAGATC 


CACGAGGCTT 


ATCTTCCAGA 


ACTACCTTTC 


2820 


ATTCCAGGAA 


TAAATGTGGT 


GATTTTTGAA 


GAATTAGAAA 


ACCGTATTTC 


TAC7GCATTA 


2880 


TCCCTATATG 


ATGCGAGAAA 


TGTCATTAAA 


AATGGCGATT 


TCAATAATGG 


CTTATCATGC 


2940 


TGGAACGTGA 


AAGGGCATGT 


AGATGTAGTA 


GAACAAAACA 


ACCACCGTTC 


GGTCCTTGTT 


3000 


6TCCCGGAAT 


GGGAAGCAGA AGTCTCACAA 


AGAATTCGTG 


TCTGTCCGGG 


GCGTGGCTAT 


3060 


ATCC7CCGTG 


TTACAGCGTA 


CAlkAGAGGGA 


TATGG2USAAG 


GTTGCGTAAC 


CATCCATGAG 


3120 


ATCGAGAACA 


ATACAGACGA ACTAAAATTT 


AAAJU^CTGTG 


AAGAA6AGGA 


AGTGTATCCA 


3180 


ACGGATACAG 


GAACGTGTAA 


TGATTATACT 


6CACACCAAG 


GTACAGCAGG 


ATCCACAGAT 


3240 


TCATGTAATT 


CCCGTAATAT 


CA6ATATGAG 


GATGCATATG 


AAATGAATAC 


TACA6CATCT 


3300 


GTTAATTACA 


AACCGACTTA 


CGAAGAAGAA 


AGGTATACAG 


ATGTACAAGG 


AGATAATCAT 


3360 


TCTGAATATG 


ACAGAGGGTA 


T6TCAATTAT 


CGACCAGTAC 


CAGCTGGTTA 


T6TGACAAAA 


3420 


GAATTAGAQT 


ACTTCCCAGA AACCGATAAG 


GTATGGATTG 


A6ATCGGAGA 


AACGGAAGGG 


3480 


AAGT7TATTG 


TAGACAATGT 


CGIkATTACTC 


CTTATGGAGG 


AA 




3522 



(2) INFORMATION FOR SSQ ID NOs2: 

(i) SEQUENCE CHARACTERISTICS! 

(A) I.SNGTBS 1174 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDHESS: single 

(D) TOPOLOGY: linear 

(ii) MOUCUiiE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<iv) ANTI -SENSES NO 

<vi) ORIGINAL SOURCE: 

(A> ORGANISM: Bacillus thuringiensis 

(B) STRAIN: aizawai 

(C) INDIVIDUAL ISOLATE: PS81A2 

(Vii) IMMEDIATE SOURCE: 

i^l^^i.^f^"^^^ " <^> ^^rary of August sick 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOt2t 

Met Glu Asn Asn lie Glu Aan Oln Cys lie Pro Tyr Asn Cys l^u Aan 

•^5 10 15 

Asn Pro Glu val Glu lie Leu Gly lie Glu Arg ser Asn Ser Asn V«l 
20 25 30 

Ala Ala Glu lie Gly Leu Gly Leu Ser Arg Leu Leu Val Ser Arg He 

35 40 45 

Pro Leu Gly Asp Phe He Leu Gly Leu Phe Asp Val He Trp cly Ala 
50 55 60 

lie Gly Pro Ser Gin Trp Asp He Phe Leu Glu Gin He Glu Leu Leu 
®^ 70 75 80 

He Gly Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala He Ser Arg 
85 90 ^5 

Leu Gin Gly Leu ser Asn Leu Tyr Arg He Tyr Thr Asn Ala Phe Lys 

Asn Trp Glu val Asp Pro Thr Asn Pro Ala Leu Arg Glu Glu Met Ara 
115 120 125 

■^'^ Ala He Pro Leu 

135 

Phe ser Val Gin Gly Tyr Glu He Pro Leu Leu Ser Val Tyr Val Gin 

"0 155 160 

Ala Ala Asn Leu His Leu ser Val Leu Arg Asp Val ser Val Pha Gly 
165 170 175 

Gin Arg Trp cly Phe Asp Val Ala Thr He Asn ser Arg Tyr Asn Asp 
Leu Thr Arg Leu He Gly Glu Tyr Thr Asp Tyr Ala Val Arg Trp Tyr 

Asn Thr Gly Leu Asn Arg Leu Pro Arg Asn Glu Gly val Arg Gly Tro 

215 220 

Ala Arg Phe Asn Arg Phe Arg Arg Glu Leu Thr He s r Val Leu Asd 

230 235 240 
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lie Xle ser Phe Phe Gin Asn Tyr Asp S r Arg Leu Tyr Pro lie pro 

245 250 255 

Thr lie Tyr Gin IfCU Thr Arg Glu Val Tyr Thr Asp Pro val He Asn 
260 265 270 

He Thr Asp Tyr Arg val Thr Pro ser Phe Glu Ser He Glu Asn Ser 

275 280 285 

Ala He Arg ser Pro Bis Z<eu Met Asp Phe Leu Asn Asn He He lie 
290 295 300 

Asp Thr Asp Leu He Arg Gly val His Tyr Trp Ala Gly His Arg Val 
305 310 315 320 

Thr ser His Phe Thr Gly Ser Ser Gin Val He Ser Ser Pro Gin Tyr 
325 330 335 

Gly He Thr Ala Asn Ala Glu Pro Ser Arg Thr He Ala Pro Ser Thr 
340 345 350 

Phe Pro Gly Leu Asn Leu Phe Tyr Arg Thr Leu ser Asp Pro Phe Phe 
355 360 365 

Arg Arg ser Asp Asn He Met Pro Thr Z«eu Gly He Asn Val Val Gin 
370 375 380 

Gly Val Gly Phe He Gin Pro Asn Asn Gly Glu Val Leu Tyr Arg Arg 
385 390 395 400 

Arg Gly Thr Val Asp ser Leu Asp Glu Leu Pro He Asp Gly Glu Asn 
405 410 415 

ser Leu Val Gly Tyr Ser His Arg Leu ser His Val Thr Leu Thr Arg 
420 425 430 

ser Leu Tyr Asn Thr Asn He Thr ser Leu Pro Thr Phe Val Trp Thr 
435 440 445 

His Bis ser Ala Thr Asp Arg Asn He xle Tyr Pro Asp val He Thr 
450 455 460 

Gin lie Pro Leu val Lys ser Phe Ser Leu Thr Ser Gly Thr Ser Val 
465 470 475 480 

Val Arg Gly Pro Gly Phe Thr Gly Gly Asp He He Arg Thr Asn Val 
485 490 495 

Asn Gly Asn val Leu Ser Met ser Leu Asn Phe Ser Asn Thr Ser Leu 
500 505 510 

Gin Arg Tyr Arg val Arg Val Arg Tyr Ala Ala ser Gin Thr Met Val 

515 520 525 

Met Arg Val Asn val Gly Gly ser Thr Thr Phe Asp Gin Gly Phe Pro 
530 535 540 

Ser Thr Met Ser Ala Asn Gly Ser Leu Thr Ser Gin ser Phe Arg Phe 
545 550 555 560 

Ala Glu Phe Pro Val Gly H Ser Thr ser Gly ser Gin Thr Ala Gly 
565 570 575 

He S r He Ser Asn Asn Pr Gly Arg Gin Thr Phe His Leu Asp Arg 
580 585 590 
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lie Glu Phe lie Pro Val Asp Ala Thr Phe oiu Ala Glu Tyr Asp Jma 

5»5 600 605 

Glu Arg Ala Gin Lys Ala Val Asn ser Lan phe Thr Ser Ser Asn Gin 
•lO 615 620 

lie Glu Leu tys Thr Asp Val Thr Asp Tyr Bis lie Asp Gin Val Ser 

"5 "0 635 

Asn Leu val Asp cys Leu ser Asp Glu Phe cys Leu Asp Glu Lvs Aro 
"5 650 655 

Glu Leu ser Glu Lys Val Lys Hi. Ala Lys Arg Leu Ser Asp Glu Arg 

665 670 

Asn Leu Leu Gin Asp pro Asn Phe Arg Gly lie Asn Arg Gin Pro Asp 

680 685 

Arg Gly Trp Arg Gly ser Thr Asp He Thr He Gin Gly Gly Asp Asp 

695 700 J F e 

val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr pbe Asp Glu Cys 

710 715 720 

Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys Ala 

'25 730 735 

Tyr Asn Arg Tyr Gin Leu Arg Gly Tyr He Glu Asp ser Gin Asp Leu 

Glu He Tyr Leu He Arg Tyr Asn Ala Lys His olu Thr val Asn val 

760 765 

Pro Gly Thr oly ser Leu Trp Pro Leu ser Val Glu ser Pro He Gly 

775 780 

Arg cys Gly Glu Pro Asn Arg Cys Val Pro His Leu Glu Trp Asn Pro 

795 800 
Asp Leu Asp cys Ser cys Arg Asp Gly Glu Lys Cys Ala His ais Ser 

810 815 

His Bis Phe ser Leu Asp He Asp val Gly cys Thr Asp Leu Gin Glu 

825 830 

Asp Leu Gly val Trp Val Val Phe Lys He Lys Thr Gin Glu Gly Tyr 

840 845 "* ' 

^* f^t f I^u He Gly 

855 860 

Glu Ala Leu ser Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys 

Arg Glu Lys Leu Gin Leu olu Thr Lys Arg val Tyr Thr Glu Ala Lys 

885 890 895 • 

Glu Ala val Asp Ala Leu Phe Val Asp ser Gin Tyr Asp Arg Leu Gin 

Ala Asp Thr Asn He Gly Met He His Ala Ala Asp Arg Leu val His 

. 920 925 

930 ""^^ ^* «S »~ Gly He 

935 940 
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Asn Val val lie Phe Glu Glu Zieu Glu Asn Arg He s r Thr Ala JsBXX 
945 950 955 960 

ser I«eu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn 
965 970 975 

Gly Leu ser cys Trp Asn Val Lys Gly His Val Asp Val Val Glu Gin 
980 985 990 

Asn Asn His Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val 
995 1000 1005 

Ser Gin Thr He Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val 

1010 1015 1020 

Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He Bis Glu 
1025 1030 1035 1040 

Zle Glu Asn Asn Thr Asp Glu Leu Lys Phe Lys Asn Cys Glu Glu Glu 
1045 1050 1055 

Glu Val Tyr Pro Thr Asp Thr Gly Thr Cys Asn Asp Tyr Thr Ala Bis 
1060 1065 1070 

Gin Gly Thr Ala Gly ser Thr Asp Ser Cys Asn Ser Arg Asn He Arg 
1075 1080 1085 

Tyr Glu Asp Ala Tyr Glu Me^ Asn Thr Thr Ala ser Val Asn Tyr Lys 
1090 1095 1100 

Pro Thr Tyr Glu Glu Glu Arg Tyr Thr Asp Val Gin Gly Asp Asn His 
1105 1110 1115 1120 

cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr Arg Pro Val Pro Ala Gly 
1125 1130 1135 

Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp 
1140 1145 1150 

He Glu He Gly Glu Thr Glu Gly Lys Phe He Val Asp Asn Val Glu 
1155 1160 1165 

Leu Leu Leu Me-t Glu Glu 
1170 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 3504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECTXLB TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus ^huringiensis 

(B) STRAIN: Morrissoni 

(C) INDIVIDUAL ISOLATE: PS91C2 



(vii) IMMEDIATE SOURCE: 

<A) LIBRARY: LainbdaGeai (TM) - 11 Library f Ter sa 
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ThonpBon 
(B) CI.ONE: 91C2 



<xi) SEOUENCE DESCRXPTION: SEQ ZD NO: 2: 








ATG2UU3AA.TA 


ACATTCAAAA 


TCAATGCGTA 


CCTTACAATT 


GTTTAAGTAA 


TCCTGAAGTA 


60 


GJUVATATTAA 


GTGAAGAAAG 


AAGTAGTGGC 


AGATTACCGT 


TAGATATATC 


CTTGTCGCTT 


120 


ACAC6TTTCC 


TTTTGAGTGA ATTTGTTCGA 


GGTGTGGGAG 


TTGCGTTTGG 


ATTATTTGAT 


180 


TOIAATATGGG 


GTTTTA7AAC 


TCCTTCTGAA 


TGGAGTTTAT 


TTCTTTTACA 


GATTGAACAA 


240 


CTGATT6AAC 


AAAGAATT6A 


AACATTGGAA 


AGGAACCGGG 


CAATTACTAC 


ATTACGAGGG 


300 


TTAGCGGATA 


GCTATGAAGT 


TTACCTTGAG 


GCACTAAGAG 


AGTGGGAAGA AAATCCTAAT 


360 


AAT6CACAAT 


TAAGGGAAGA 


TGTGCGTATT 


CGATTTGCTA 


ATACAGACGA 


CGCTTTAATA 


420 


ACA6CAATAA 


ATAATTTTAC 


ACTTACAAGT 


TTTGAAATCC 


CTCTTTTATC 


GGTCTATGTT 


480 


CAA6CGGCGA 


ATCTACATTT 


ATCACTATTA 


AGAGATGCTG 


TATCGTTTCG 


GCAGGGTTGG 


540 


G6GCTGGATA 


TAGCTACTGT 


TAATAATCAT 


TATAATAGAT 


TAATAAATCT 


TATTCATAGA 


600 


TATACGGAAC 


ATTGTTTGGA 


CACATACAAT 


CAA6GATTAG 


AAAACTTAAG 


AGGTACTAAT 


660 


ACTCGACAAT 


GGTCAAGATT 


CAATCAGTTT 


AGGAGAGAGT 


TAACATTGAC 


TGTATTAGAT 


720 


ATCGTTGCTC 


TTTTTCCGAA 


CTACGATGCT 


AGAGCATATC 


CAATTCAJkAC 


GTCATCCCAA 


780 


TTAACAAGGG 


AAATTTATAC 


AAGTTCAGTA 


ATTGAAGATT 


CTCCAGTTTC 


TGCTAATATA 


840 


CCTAATGGTT 


7TAATAGAGC 


GGAATTTGGA 


GTTAGACCGC 


CCCATCTTAT 


GGACTTTATG 


900 


AATTCTTTGT 


TTGTAACTGC 


AGAGACTGTT 


AGAA6TCAAA 


CTGTGTGGGG 


AGGACACTTA 


960 


GTTAGTTCAC 


GAAATACGGC 


TGGTAACCCT 


ATAAHTTTCC 


CTATTTATGG 


GGTCTTCAAT 


1020 


CCTG6TGGCG 


CCATTTGGAT 


TGCAGATGAG 


GATCCACGTC 


CTTTTTATCG 


GACATTATCA 


1080 


GATCCTGTTT 


TTGTCCGAGG 


AGGATTTGGG 


GATCCTCATT 


ATGTACTTGG 


GCTTAGGGGA 


1140 


GTAGGATTTC 


AACAAACTGG 


TACGAACCAC 


ACCCGAACAT 


TTAGAAATAG 


TGGGACCATA 


1200 


GATT-CTCTAG^ATGAAATCCC ACCTCAGGAT AATAGTGGCe gAgg«P«yv?<yftR 


-SGATTATA6T -. 


—1260 


CATGTATTAA ATCATGTTAC 


ATTTCTAAGG 


TGGCCTGGTG AGATTGCAGG AAGTGATTCA 


1320 


TGGAGAGCGC 


CAATGTTTTC 


TTGGACACAC 


CGTAGTGCAG 


ATCGTACAAA 


TATCATXAAT 


1380 


CCAAATATAA TTACACAAAT ACCTGCTGTA AiUVGCACACA ATCTTCATTC 


GGGTTCTAOG 


1440 


GTTGTTAGAG 


GACCCGGGTT 


TACAGGTGGT 


GATCTCTTAC 


GIIAGAACGAA 


TACTGGTACA 


1500 


TTTGCAGATA 


TAAGAGTAAA 


TATTACTGGG 


CCATTATCTC 


AAAGATATCG 


TGTAAGAATT 


1560 


CGCTATGCTT 


CTACGACAGA 


TTTACAATTT 


TTCACGAGAA 


TCAATGGAAC 


TTCTGTAAAT 


1620 


CAAGGTAATT 


TCCAAAGAAC 


TATGAATAGA 


GGGGATAATT 


TAGAATCTGG 


AAACTTTAGG 


1680 


ACTGCAGGAT 


TTAGTACGCC 


TTTTAGTTTT 


TCAAATGCGC AAAGTACATT CACATTGGGT 


1740 


ACTCAGGCTT 


TTTCAAATCA 


GGAAGTTTAT 


ATAGATCGAA 


TTGAATTTGT 


CCCGGCAGAA 


1800 


GTAACATTCG 


AGGCAGAATC 


TGATTTAGAA 


AGAGCGCAAH AGGCGGTGAA TGCCCTGTTT 


1860 
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ACnCTACAA 


GCCAACTA6G 


6CTAAAAACA 


AATGTAACGG 


GTTACCATAT 


TGATCAAGTG 


1920 


TCCAATTTAG 


TTGCGTGTTT 


ATCGGATGAA 


TTTTGTCTGG 


ATGAAAAGAG 


AGAATTGTCC 


1980 


GAGAAA6TTA 


AACATGCGAA 


GCGACTCAGT 


GATAAGCG6A 


ATTTACTTCA AGATCCAAAC 


2040 


TTCJU3AGGGA 


TCAATAGGCA 


ACCAGACCAT 


GGCTGGAGAG 


GAAGTACGGA 


TATTACTATC 


2100 


CAAGGAGGAG 


ATGACGtTATT 


CAAAGAGAAT 


TACGTTACGC 


TACCGGGTAC 


TTTTGATGAG 


2160 


TGCTATCCAA 


CGTATTZATA 


TCAJkAAAATA 


GATGAGTCGA 


AATTAAAAGC 


CTATACCCGT 


2220 


TATCAATTAA 


GAGGGTATAT 


CGAAGATAGT 


CAAGACTTAG 


AlkATCTATTT 


AATTCGTTAC 


2280 


AATTCAAAAC 


ACGAAATAGT 


AlUlTGTACCA 


GGTACAGGGA 


GTTTATGGCC 


TCTTTCTGTA 


2340 


GAAAATCAAA 


TTGGACCTTG 


TGGAGAACCG 


AATCGATGCG 


CGCCACACCT 


TGAATGGAAT 


2400 


CCTGATTTAC 


ACTGTTCCTG 


CAGAGACGGG 


GAIUUUITGTG 


TGCATCATTC 


TCATCATTTC 


2460 


TCTTTGGACA 


TTGATGTCGG 


ATGTACAGAT 


TTAAATGAGG 


ACCTAGGTGT 


ATGGTTGATA 


2520 


TTCAAGATTA 


A6ACGCAAGA 


TGGCCACGCA 


AGACTAGGGA 


ATCTAGAGTT 


TCTCGAAGAG 


2580 


GAACCGTTAT 


TAGGCGAAGC 


GTTAGGACGT 


GTGAJkGAGAG 


CGGAGAAGAA 


GTGGAGAGAC 


2640 


AAACGCGAGA 


AACTGCAGTT 


GGAAACAAAT 


ATTGTCTATA 


AAGAGGCAAA AGiVATCTG7A 


2700 


GAT6CTTTAT 


TTGTAAACTC 


TCAATATGAT 


AGATTACAAG 


CGGATACGAA 


CATCGCGATC 


2760 


ATTCATGCGG 


CAGATAAACG 


C6TTCATAGA 


ATCCGGGAAG 


CGTATCTGCC 


AGAGTTGTCT 


2820 


GTGATTCCAG 


GTGTCAATGC 


GGCCATTTTC 


GAAGAATTAG 


AGGGACGTAT 


T7TTACAGCG 


2880 


TATTCCTTAT 


ATGATGCGAG 


AAATGTTATT 


A2UUUITGGCA 


ATTTCAATA2W TGGCjTTATTA 


2940 


TGCTGGAACG 


TGAAAGGGCA 


TGTAGATGTA 


GAAGAGCAAA 


ACAACCACCG 


TTCGGTCCTT 


3000 


6TTGTTCCGG 


AATGGGAAGC 


AGAAGTGTCA 


CAAGAA6TTC 


GTGTCTGTCC 


GGGTCGTGGC 


3060 


TATATCCTTC 


GTGTCACAGC 


GTACAAAGAG 


GGATATGGAG 


AAGGCTGCGT 


AACTATTCAT 


3120 


GAAGTCGATA 


ATAATACAGA 


CGAATTGAAG 


TTTAGCAACT 


GTGAGAAAGA ACAAGTATAT 


3180 


CCAGGTAATA 


CGGTAGCAT6 


TAATGATTAT 


AATAAGAHTC 


ACGGTGOGAA 


TGCATGTAGT 


3240 


TCTCGTAATC 


GTG6ATAT6A 


CGAATCTTAT 


GAAAGTAATT 


CTTCCATACC 


AGCTGATTAT 


3300 


GGACCGGTTT 


ATGAAGAAGA 


AGCGTATACA 


GATOGACAAA 


GAGGGikATCC 


TTGTGAATTT 


3360 


AACAGAGGGC 


ATACACCATT 


ACCAGCTGGT 


TATGTGACAG 


CAGAGTTAG2W 


GTACTTCCCA 


3420 


GAimCGGATA 


CAGTATGGGT 


TGAGATTCGA 


GAAACGGAAG 


GAACATTTAT 


CGTGGACAGT 


3480 


6TGGAATTAC 


TCCTTATGGA 


GGAA 








3504 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1168 amino acids 
(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOZ^Y: linear 

(ii) MOLECULE TYPE: protein 
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(111) HYPOTBETICAI.3 YES 
(iv) ANTX-SENSEs NO 

(Vi) ORIGINAI. SOURCES 

(A) ORGANISM: BACILLUS THURINGIENSIS 

(B) STRAIN: Morrissoni 

(C) INDIVIDUAL ISOLATE: PS91C2 

(Vii) IMMEDIATE SOURCE: 

(A> LIBRARY: LAMBDAGEM (TM) - 11 LIBRARY OF TERESA 

THOMPSON 
(B) CLONE: 91C2 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Asn Asn He Gin Asn Gin cys Val Pro Tyr Asn Cya Leu Ser 
1 5 10 15 

Asn Pro Glu Val Glu He Leu Ser Glu Glu Arg ser Thr Gly Arg Leu 
20 25 30 

Pro Leu Asp He Ser Leu Ser Leu Thr Arg Phe Leu Leu ser Glu Phe 
35 40 45 

Val Pro Gly val Gly Val Ala Phe Gly Leu Phe Asp Leu He Trp Gly 
50 55 60 

Phe He Thr Pro ser Glu Trp ser Leu Phe Leu Leu Gin He Glu Gin 
«5 70 75 80 

Leu He Glu Gin Arg He Glu Thr Leu Glu Arg Asn Arg Ala He Thr 
85. 90 95 

Thr Leu Arg Gly Leu Ala Asp Ser Tyr Glu Val Tyr Leu Glu Ala Leu 

100 105 110 

Arg Glu Trp Glu Glu Asn Pro Asn Asn Ala Gin Leu Arg Glu Aso Val 
115 120 125 

Arg He Arg Phe Ala Asn Thr Asp Asp Ala Leu He Thr Ala He Asn 

130 135 140 

Asn Phe Thr Leu Thr Ser Phe Glu He Pro Leu Leu ser Val Tyr val 
145 150 155 160 

Gin Ala Ala Asn Leu Bis Leu Ser Leu Leu Arg Asp Ala val Ser Phe 

165 170 175 

Gly Gin Gly Trp Gly Leu Asp He Ala Thr Val Asn Asn Bis Tyr Asn 
180 185 190 

Arg Leu He Asn Leu He Bis Arg Tyr Thr Glu His cys Leu Asd Thr 
195 200 205 

Tyr Asn Gin Gly Leu Glu Asn Leu Arg Gly Thr Asn Thr Arg Gin Trp 
210 215 220 

ser Arg Phe Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val Leu Asd 
225 230 235 240 

He Val Ala Leu Phe Pro Asn Tyr Asp Ala Arg Ala Tyr Pro He Gin 
245 250 255 
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Thr ser ser oln l.eu Thr Arg Glu lie Tyr Thr s r Ser val lie Glu 
260 265 270 

ABp ser Pro Val Ser Ala Asn lie Fro Asn Gly Phe Asn Arc Ala Glu 
275 280 285 

Phe Gly val Arg Pro Pro Bis Leu Met Aap Phe Met Asn Ser Leu Phe 
290 295 300 

val Thr Ala Glu Thr Val Arg Ser Gin Thr val Trp Gly Gly His Leu 
305 310 315 320 

Val ser Ser Arg Asn Thr Ala Gly Asn Pro lie Asn Phe Pro He Tvr 
325 330 335 ^ 

Gly Val Phe Asn Pro Gly Gly Ala He Trp He Ala Asp Glu Asp Pro 
340 345 350 

Arg Pro Phe Tyr Arg Thr Leu Ser Asp Pro Val phe Val Ara Glv Glv 

355 360 365 

Phe Gly Asp Pro His Tyr Val Leu Gly Leu Arg Gly val Gly Phe Gin 
370 375 380 

Gin Thr Gly Thr Asn His Thr Arg Thr Phe Arg Asn ser Gly Thr He 
385 390 395 ' 

Asp ser Leu Asp Glu He Pro Pro Gin Asp Asn Ser Gly Ala Pro Tro 
405 410 415 ^ 

Asn Asp Tyr Ser His Val Leu Asn His Val Thr Phe Val Arc Tro Pro 
420 425 430 

Gly Glu He Ala Gly ser Asp Ser Trp Arg Ala Pro Met Phe Ser Trp 
435 440 445 ^ 

Thr His Arg ser Ala Asp Arg Thr Asn He He Asn Pro Asn He He 
450 455 460 

Thr Gin He Pro Ala Val Lys Ala His Asn Leu His ser Gly Ser Thr 
465 470 475 480 

val Val Arg Gly Pro Gly Phe Thr Gly Gly Asp Leu Leu Arg Arg Thr 
485 490 495 

Asn Thr Gly Thr Phe Ala Asp He Arg Val Asn He Thr Gly Pro Leu 
500 505 510 

ser Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr Asp Leu 

515 520 525 

Gin Phe Phe Thr Arg He Asn Gly Thr Ser Val Asn Gin Gly Asn Phe 
530 535 540 

Gin Arg Thr Met Asn Arg Gly Asp Asn Leu Glu Ser Gly Asn Phe Aro 
545 550 555 S60 

Thr Ala Gly Phe Ser Thr Pro Phe Ser Phe Ser Asn Ala Gin Ser Thr 
565 570 575 

Phe Thr Leu Gly Thr Gin Ala Phe Ser Asn Gin Glu Val Tyr He Asp 
580 585 590 

Arg He Glu Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Ser Asp 
595 600 605 
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Z.eu Glu Arg Ala Gin liys Ala Val Asn Ala Xieu Phe Thr Ser Thr Ser 

610 615 620 

Gin Xfeu Gly Leu Lys Thr Asn Val Thr Gly Tyr His He Aap Gin Val 
625 630 635 640 

ser Aan Leu Val Ala Cys Leu ser Asp Glu Phe cys Leu Asp Glu Lys 
645 650 655 

Arg Glu Leu Ser Glu Lys Val Lys Bis Ala Lys Arg Leu Ser Asp Lys 
660 665 670 

Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly lie Asn Arg Gin Pro 

675 680 685 

Asp His Gly Trp Arg Gly ser Thr Asp Xle Thr He Gin Gly Gly Asp 
690 695 700 

Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu 

705 710 715 720 

Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu ser Lys Leu Lys 
725 730 735 

Ala Tyr Thr Arg Tyr Gin Leu Arg Gly Tyr He Glu Asp ser Gin Asp 
740 745 750 

Leu Glu He Tyr Leu He Arg Tyr Asn Ser Lys His Glu He Val Asn 
755 760 765 

Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Val Glu Asn Gin He 
770 775 780 

Gly Pro cys Gly Glu Pro Asn Arg Cys Ala Pro Bis Leu Glu Trp Asn 
785 790 795 800 

Pro Asp Leu His Cys ser Cys Arg Asp Gly Glu Lys cys Val His Bis 

805 810 815 

Ser His Bis Phe Ser Leu Asp He Asp Val Gly cys Thr Asp Leu Asn 
820 825 830 

Glu Asp Leu Gly Val Trp Leu He Phe Lys He Lys Thr Gin Asp Gly 
835 840 645 

His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Glu Pro Leu Leu 
850 855 860 

Gly Glu Ala Leu Gly Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp 
865 870 875 880 

Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn xle Val Tyr Lys Glu Ala 
885 890 895 

Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Arg Leu 

900 905 910 

Gin Ala Asp Thr Asn He Ala Met He His Ala Ala Asp Lys Arg Val 
915 920 925 

His Arg He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly 
930 935 940 

Val Asn Ala Ala Xle Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala 
945 950 955 960 
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Tyr ser I.eu Tyr Asp Ala Arg Asn Val Zle l^ys Asn Gly Aan Phe Asn 
965 970 975 

Asn Gly lieu I.eu cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu 
980 985 990 

Gin Asn Asn Bis Arg Ser Val Leu Val val Pro Glu Trp Glu Ala Glu 
995 1000 1005 

Val ser Gin Glu Val Arg val cys Pro Gly Arg Gly Tyr Zle I.eu Arc 
1010 1015 1020 

Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly cys Val Thr lie His 
1025 1030 1035 1040 

Glu Val Asp Asn Asn Thr Asp Glu l.eu Lys Phe ser Asn Cys Glu Lys 
1045 1050 1055 

Glu Gin Val Tyr Pro Gly Asn Thr Val Ala Cys Asn Asp Tyr Asn Lys 
1060 1065 1070 

Asn His Gly Ala Asn Ala Cys Ser Ser Arg Asn Arg Gly Tyr Asp Glu 
1075 1080 1085 

Ser Tyr Glu Ser Asn ser Ser lie Pro Ala Asp Tyr Ala Pro Val Tyr 
1090 1095 1100 

Glu Glu Glu Ala Tyr Thr Asp Gly Gin Arg Gly Asn Pro Cys Glu Phe 
1105 1110 1115 1120 

Asn Arg Gly His Thr Pro z*eu Pro Ala Gly Tyr Val Thr Ala Glu lieu 
1125 1130 1135 

Glu Tyr Phe Pro Glu Thr Asp Thr Val Trp Val Glu lie Gly Glu Thr 
1140 1145 1150 

Glu Gly Thr Phe lie val Asp ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 1165 



<2) INFORMATION FOR SEQ ID NOs5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECX7LE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATGATTCATG CGGCAGATA 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
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(Xi) SEQUENCE DESCRIPTIONS SEQ ID NOt6: 
GAGTCGGAA6 CAGATCTTAA TAATGCACAA TTAAG6 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOiX)GY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATACYCGATC 6ATATGATAR TCCGT 



(2) INFORMATION FOR SEQ ID NO; 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CCCAAT6T6A AT6TACTTT6 CGC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Ser Thr Gly Arg Leu Pro Leu Asp 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGTACWG6MA GRTTACCRTT RGAY 
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(2) ZNFORHlWTIOZr FOR SEQ ZD NO: 11: 

(i) SBOtnSNCE CBAimCTERZSTZCS: 

(A) IiEN GTHt 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOliOGY: linear 

(ii) MOLECUI^ TTPE: peptide 

(xi) SEQUENCE DESCRZPTZON: SEQ ZD NO: 11: 

Glu Asp ser Pro Val Ser Ala Asn 

5 



(2) ZNFOXtMATZON FOR SEQ ZD NO: 12: 

(i) SEQUENCE CHARACTERZSTZCS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQXmNCE DESCRZPTZON: SEQ ZD NO: 12: 
GARGATTCWC CAGTWTCWGC WAAT 



(2) INFORMATION FOR SEQ ZD NO: 13: 

(i) SEQUENCE CHARACTERZSTZCS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asn Gly Phe Asn Arg Ala Glu Phe Gly Val 

5 10 



(2) INFORMATION FOR SEQ ZD NO:14i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AATGGWTTTA ATAGTGCTGA ATTTGGGAGT W 



(2) INFORMATION FOR SEQ ZD NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(C) STKAMDEDKESS: single 

(D) TOPOliOGY: linear 

<ii) MOLECUI«E TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Val Thr Ala Glu Thr Val Arg Ser Gin Thr 
5 10 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) liENGTH: 30 bases 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOI.ECUI.E TYPE: DNA (synthetic) 
(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTAACWGCAG ARACWGTHAG WAGTCAAACW 30 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) UCNGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Val Pbe Asn Pro Gly Gly Ala lie Trp lie Ala Asp Glu 
5 10 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTKTTYAATC CWGGWGGMGC MATWTGGATW GCWGATGARG AT 42 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCXUPTION: SEQ ZD NOsl9 

Val Arg Gly Gly Phe Gly 
5 



(2) IMPORMATIOH FOR SEQ ID NO:20: 

<i) SEQUENCE CHJkRACTERZSTZCS : 

(A) LENGTH: 18 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOI.OGY: linear 

(11) MOLECULE TYPES DNA (synthetic) 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

GTMMGAGGWG GWTTTGGR 



(2) INFORMATION FOR SEQ ID NO 1 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gly Thr Asn His Thr Arg Thr 
5 



(2) INFORMATION FOR SEQ ID NO: 22: 

(1) SEQ17ENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (synthetic) 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGKACRAAYC AYACMMGAAC W 



(2) INFORMATION FOR SEQ ID NO: 23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Val Arg Trp Pro Gly Glu He 

5 
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(2) IHFORMATIOM FOR SEQ ZD NOs24: 

(1) SEQUEKCE CHARACTERISTICS: 
(A> I.ENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRAMDEDKESS : single 
<D) TOPOLOGY: linear 

(ii) MOI.ECX7I.E TYPE: DMA (synthetic) 

(Xi) SEQX7EMCE DESCRIPTIOK: SEO ID NO:24: 

GTHiraATGGC CWGGWGARAT W 21 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Ser Asp Ser Trp Arg Ala 
5 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AGTGATTCWT GGAGAGCW 18 

(2) INFORMATION FOR SEQ ID NO x 27s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met xaa Asn Asn lie Gin Asn Gin Cys val Pro Tyr Asn Cys Leu Xaa 

1 5 ' 10 15 

Asn Pro Glu Val Glu lie Leu Xaa Glu Glu Arg Ser Thr Gly Arg Leu 
20 25 30 
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Fro Leu Asp lie Ser Z.eu Ser X.eu Thr Arg Phe z.eu Leu S r Glu Phe 
35 40 45 

val Pro Gly Val Gly Val Ala Phe Gly Leu Phe Asp Leu He Trp Gly 
50 55 60 

Phe He Thr Pro ser xaa Trp Ser Leu Phe Leu Leu Gin He Glu Gin 
€5 70 75 80 

Leu He Glu Gin Arg He Glu Thr Leu Glu Arg Asn Arg Ala He Thr 
85 90 95 

Thr Leu Arg Gly Leu Ala Asp Ser Tyr Glu Xaa Tyr Xaa Glu Ala Leu 

100 105 110 

Arg Glu Trp Glu Xaa Asn Pro Asn Asn Ala Gin Leu Arg Glu Asp Val 
115 120 125 

Arg He Arg Phe Ala Asn Thr Asp Asp Ala Leu He Thr Ala He Asn 

130 135 140 

Asn Phe Thr Leu Thr ser Phe Glu He Pro Leu Leu Ser Val Tyr Val 
145 150 155 160 

Gin Ala Ala Asn Leu Bis Leu Ser Leu Leu Arg Asp Ala Val Ser Phe 

165 170 175 

Gly Gin Gly Trp Gly Leu Asp He Ala Thr Val Asn Asn His Tyr Asn 
180 185 190 

Arg Leu He Asn Leu He His Arg Tyr Thr xaa Bis Cys Leu Asn Thr 

195 200 205 

Tyr Asn Gin Gly Leu Glu Asn Leu Arg Gly Thr Asn Thr Arg Gin Tro 
210 215 220 

Xaa Arg Phe Asn Gin Phe Arg Arg Xaa Leu Thr Leu Thr Val Leu asd 
225 230 235 240 

He Val Ala Leu Phe Pro Asn Tyr Asp Xaa Arg xaa Tyr Pro He Gin 
245 250 255 

Thr ser ser Gin Leu Thr Arg Glu He Tyr Thr Ser Ser val He Glu 
260 265 270 

Asp ser Pro Val Ser Ala Asn He Pro Asn Gly phe Asn Arg Ala Glu 
275 280 285 

Phe Gly Val Arg Pro Pro His Leu Met Asp Phe Met Asn Ser Leu Phe 

290 295 300 

Val Thr Ala Glu Thr Val Arg ser Gin Thr Val Trp Gly Gly His Leu 
305 310 315 320 

val ser ser Arg Asn Thr Ala Gly Asn Xaa He Asn Phe Pro Xaa Tvr 

325 330 335 

Gly val Phe Asn Pro Gly Gly Ala He Trp He Ala Asp Glu Asp Pro 

Arg Pro Phe Tyr Arg Thr Leu Ser Asp Pro Val Phe Val Arg Glv Glv 

355 360 365 ^ ^ 

Phe Gly xaa Pro His Tyr Val Leu Gly Leu Arg Gly val Xaa Phe Gin 
370 375 380 
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Gin Thr Gly Thr Asn His Thr Arg Thr Phe Arg Asn Ser Gly Thr He 
385 390 395 400 

Asp Ser l«eu Asp Glu lie Pro Pro Gin Asp Asn ser Gly Ala Pro Trp 
405 410 415 

Asn Asp Tyr Ser Bis Val I<eu Asn His Val Thr Phe Val Arg Trp Pro 
420 425 430 

Gly Glu He Xaa Gly Ser Asp Ser Trp Arg Ala Pro Met Phe Ser Trp 
435 440 445 

Thr Bis Arg Ser Ala Xaa Xaa Thr Asn Xaa lie Xaa Pro Xaa Xaa He 
450 455 460 

Thr Gin He Pro Xaa Val Xaa Ala Bis Xaa Leu xaa Ser Gly Xaa Thr 
465 470 475 480 

Val Val Arg Gly Pro Gly Phe Thr Gly Gly Asp Xaa Z.eu Arg Arg Thr 
485 490 495 

Xaa Xaa Gly Xaa Phe Ala Xaa Xaa Xaa Val Asn He Xaa Gly Xaa Leu 
500 505 510 

Xaa Gin Arg Tyr Arg Xaa Arg He Arg Tyr Ala ser Thr Thr Xaa Leu 

515 520 525 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa xaa Xaa Xaa xaa Gly Xaa Phe 
530 535 540 

Xaa Xaa Thr Mel: Xaa Xaa Gly Asp Xaa Leu xaa Xaa Xaa Xaa Phe Xaa 

545 550 555 560 

Xaa Ala Xaa Xaa Xaa Thr Xaa Phe Xaa Phe Xaa Xaa Xaa Gin ser Xaa 
565 570 575 

Phe Thr Xaa Gly Xaa Xaa Xaa Phe Xaa Ser xaa xaa Glu Val Tyr He 

580 585 590 

Asp Xaa Xaa Glu Xaa Xaa Pro Xaa xaa xaa Thr Phe Glu Ala Glu Xaa 
595 600 605 

Asp Xaa Glu Arg Ala Gin Xaa Ala Val Asn Ala Leu Phe Thr Ser Xaa 

610 615 620 

Xaa Gin Xaa Gly Xaa Xaa Thr Xaa Val Thr Xaa Tyr Bis He Asp Gin 
625 630 635 640 

Val Ser Asn z.eu Val Xaa cys Leu Ser Asp Glu Phe Cys Leu Asp Glu 
645 650 655 

Xaa Arg Glu Leu ser Glu Xaa Val Bis Xaa Ala Xaa Arg Leu Ser Asp 
660 665 670 

Xaa Arg Asn Leu Leu Gin Asp Pro Asn Phe Xaa Gly He Asn Arg Gin 
675 680 685 

Xaa Asp Xaa Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Xaa Gly 
690 695 700 

Asp Asp Val Phe Xaa Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp 

705 710 • 715 720 

Glu cys Tyr Pr Thr Tyr Leu Tyr Gin xaa He Asp Glu ser Xaa Leu 
725 730 735 
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Xaa Xaa Tyr Thr Arg Tyr Gin Z.6U Arg Gly Tyr lie Glu Asp ser Gin 
740 745 750 

Aap lieu Glu lie Tyr Z.eu lie Arg Tyr Asn Xaa Xaa Hia Glu Pro Val 
755 760 765 

Asn Val Xaa Gly Thr Gly ser Leu Trp Pro Leu Ser Val xaa Xaa Xaa 
770 775 780 

lie Xaa Xaa Cys Gly Glu Pro Asn Arg cys Ala Pro His Leu Glu Tro 
785 790 795 BOO 

Asn Pro Asp Leu Xaa Cys Ser Cys Arg Asp Gly Glu Xaa Cys xaa Bis 

805 810 815 

His Ser His His Pho Ser Leu Asp He Asp Val Gly cys Thr Asp Leu 
820 825 830 

Asn Glu Asp Leu Xaa Val Trp xaa He Phe Xaa He Xaa Thr Gin Asp 
835 840 845 

Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu xaa Pro Leu 
850 855 860 

Xaa Gly Glu Ala Leu Xaa Arg Val xaa Arg Ala Glu Xaa Xaa Trp Arg 
865 870 875 880 

Asp xaa Arg Glu Xaa Leu Xaa Leu Glu Thr Asn He Val Tyr Xaa Glu 
885 890 895 

Ala Xaa Glu Ser Val Asp Ala Leu Phe Val Asn ser Gin Tyr Asp Xaa 

900 905 910 

Leu Gin Ala Asp Thr Asn He Ala Met He His Ala Ala Asp Xaa Ara 
915 920 925 

Val His Arg He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro 
930 935 940 

Gly Val Asn Xaa Xaa He Phe Glu Glu Leu Xaa Gly Arg He Phe Thr 
945 950 955 960 

Ala xaa Xaa Leu Tyr Asp Ala Arg Asn Val He xaa Asn Gly Xaa Phe 

965 970 975 

Asn Asn Gly Leu Xaa cys Trp Asn Val Xaa Gly His Val Asp Val Glu 
980 985 990 

Glu Gin Asn Asn His Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala 

995 1000 1005 

Glu val ser Gin Glu Val Arg val Cys Pro Gly Arg Gly Tyr He Leu 
1010 1015 1020 

Arg Val Thr Ala Tyr Xaa Glu Gly Tyr Gly ^lu Gly cys Val Thr He 
1025 1030 1035 1040 

His Glu xaa Xaa Asn Asn Thr Asp Glu Leu Xaa Phe Ser Asn Cys Xaa 
1045 1050 1055 

Xaa Glu xaa Val Tyr Pro Xaa Asn Thr val xaa Cys Asn Asp Tvr Xaa 
1060 1065 1070 

xaa Asn Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Xaa ser Arg Asn Aro Glv 
1075 1080 1085 
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Tyr Asp Glu Xaa Tyr Xaa Ser Asn Ser Ser Xaa Pr Ala Asp Tyr Ala 
1090 1095 1100 

Xaa Val Tyr Glu Glu Xaa Xaa Tyr Thr Asp Gly xaa Arg xaa Asn Pro 
1105 1110 1115 1120 

cys Glu xaa Asn Arg Gly xaa Xaa xaa Xaa Thr Pro Leu Pro Ala Gly 

1125 1130 1135 

Tyr Val Thr Xaa Glu lieu Glu Tyr Phe Pro Glu Thr Asp Xaa Val Trp 
1140 1145 1150 

Xaa Glu He Gly Glu Thr Glu Gly Thr Phe He Val Asp ser Val Glu 
1155 1160 1165 

Leu Leu Leu Met Glu Glu 
1170 
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1 1. A process for controlling lepidopteran pests which comprises contacting said pests with 

2 a lepidopteran-controlling efifective amount of Bacillus thuringiensis PS91C2, or spores, crystals, 

3 or toxins from said isolate, or mutants thereof which retain activity against lepidopteran pests. 

1 2. The process, according to claim 1, wherein a substantially intact Bacillus thurinffensis 

2 isolate, or mutant thereof which retains activity against lepidopteran pests, is treated to prolong 

3 the pestiddal activity when the substantially intaa cell is applied to the environment of a target 

4 pest 

1 3. A composition of matter compnsingBacillus thurin^ensis PS91C2, or a mutant thereol^ 

2 or spores or crystals of said isolate, in association with an insecticide carrier. 

1 4. A substantially pure toxin protein wherein said toxin has activity against a lepidopteran 

2 pest and has at least one characteristic selected from the group consisting of: 

3 (a) the amino add sequence of said toxin conforms to the Generic Formula shown 

4 in SEQ ID NO. 27; 

5 (b) the amino add sequence of said toxin is at least 75% homologous with toxin 

6 91C% 

7 (c) the DNA which encodes said toxin hybridizes with DNA which encodes all or 
g part of protein 91C2; 

9 (d) the DNA which encodes said toxin hybridizes with a probe selected from the 

10 group consisting of SEQ ID NO. 10, SEQ ID NO. 12, SEQ ID NO. 14, SEQ ID 

11 NO, 16, SEQ ID NO. 18, SEQ ID NO. 20, SEQ ID NO. 22, SEQ ID NO. 24, 

12 and SEQ ID NO. 26, and DNA encoding SEQ ID NO. 9, SEQ ID NO. 11, SEQ 

13 ID NO. 13, SEQ ID NO. 15, SEQ ID NO. 17, SEQ ID NO. 19, SEQ ID NO. 21, 

14 SEQ ID NO. 23, and SEQ ID NO. 25; 

15 (e) said toxin is immunoreactive with an antibody which immunoreacts with toadn 

16 91C2; and 

17 (f) the amino add sequence of said toxin has an alignment value of at least about 

18 450 with toxin 91C2. 



1 
2 



5. The toxin, according to daim 4, wherein said toxin conforms to said Generic Formula 
shown in SEQ ID NO. 27. 
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6. The team, aocoxding to claim 4, wherein the amino add sequence of said toxin has an 
alignment value of at least about 450 with toxin 91C2. 

7. The tojdn, according to claim 4, wherein the DNA encoding said toxin hybridizes with 
DNA which encodes all or part of toxin 91C2. 

8- The toxin, according to claim 4, wherein the DNA which encodes said toxin hybridizes 
with a probe selected from the group consisting of SEQ ID NO. 10, SEQ ID NO. 12, SEQ ID 
NO, 14, SEQ ID NO. 16. SEQ ID NO. 18, SEQ ID NO. 20. SEQ ID NO, 22. SEQ ID NO. 24, 
and SEQ ID NO. 26. and DNA coding for SEQ ID NO. 9. SEQ ID NO. 11, SEQ ID NO. 13, 
SEQ ID NO. 15. SEQ ID NO. 17, SEQ ID NO. 19, SEQ ID NO. 21. SEQ ID NO. 23. and SEQ 
ID NO. 25. 

9. The toxin, according to claim 4, wherein said toxin is immunoreactive with an antibody 
which immunoreacts with protein 91C2. 

10. The toxin, according to claim 4, wherein said toxin has the amino add sequence 
consisting essentially of the sequence shown in SEQ ID NO. 4. 

11. An isolated polynucleotide encoding a Bacilbts thuringiensis toxin as defined in 

daim 4. 

12.. An isolated polynucleotide, according to claim 11, wherein said polynudeotide 
comprises DNA which encodes an amino add sequence shown in SEQ ID NO. 4. 

13. The polynudeotide, according to claim 11. \^erein said polynucleotide comprises a 
nudeotide sequence consisting essentiaUy of the sequence shown iii SEQ ID NO. 3. 

14. A method for controlling lepidopteran pests, wherein said method comprises 
contacting said pests with a lepidopteran-controUing effective amount of a toxin as defined in 
claim 1. 

15. A host transformed by a nucleotide sequence encoding a toxin as described in claim 4. 



16. The transformed host, according to daim 15, wherein said host is transformed by a 
nucleotide sequence encoding a toxin protein having the amino add sequence of SEQ ID NO. 4. 



wo 96/05314 



PCT/US95/10310 



49 

4 17. The transformed host, according to daim 15» wherein said host is transformed to 

5 eacpress the nucleotide sequence of SEQ ID NO. 3. 

1 18. A toxin encoded by a nucleotide sequence obtainable from Bacillus thuringmsis 

2 PS91C2, and variants thereof, wherein said toxin is active against iepidopteran pests. 
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MjNNIQNQCV PYNCLxNPEV EILxEERSTG KLPLDISLSL TRFLLSEFVP 
GVGVAFGLFD LIWGFlTPSx WSLFLLQIEQ LIEQRIETLE RNRAITTLR6 
LADSYEbYbE ALREWE-NPN NAQLREDVRI RFANTDDALI TAINNFTLTS 
FEIPLLSVYV QAANLHLSLL RDAVSFGQGW GLDIATVNNH YKRLINLIHR 

201 

YTjHCLDTYN QGLENLRGTN TRQW-RFNQF RRxLTLTVLD IVALFPNYDo 
RuYPIQTSSQ LTREIYTSSV lEDSPVSANI PNGFNRAEFG VRPPHLMDFM 
NSLPVTAETV RSQTVWGGHL VSSRNTAGN- INFP.YGVFN PGGAIWIADE 
DPRPFYRTLS DPVFVRGGFG zPHYVLGLRG V-FQQTGTNH TRTFRNSGTI 

401 

DSLDEIPPQD NSGAPWNDYS HVLNHVTFVR WPGEI-GSDS WRAPMFSWTH 
RSA — TNoIz Px-ITQIPoV KAH-L-SG-T WRGPGFTGG DbLRRTz-Go 

FA-o-VNI-G -L-QRYRORI RYASTTzLjb -o-b-G-SCb GxFxkTMx- 

GD-L — xzFx -A-bzToF-P QS-PTbG uxuF.SzxEV YIDkbEbbPo 

601 

-OTFEAE-DK ERAQKAVNAL FTSozQbGbK TzVTzYHIDQ VSNLV-CLSD 
EPCLDEKREL SEKVHKAKRL SDxRNLLQDP NFkGlNRQoD -GWRGSTDIT 
IQxGDDVFKE NYVTLPGTFD ECYPTYLYQK IDESKLK-YT RYQLRGYIED 
SQDLEIYLIR YN-KHEpVNV oGTGSLWPLS Vjz-Ix-CGE PNRCAPHLEW 

801 

NPDL-CSCRD GEKCoHHSHH FSLDIDVGCT DLNEDLzVWb IFKIKTQDGH 
ARLGNLEFLE EjPLbGEAL- RVKRAEKKWR DKREKLjLET NIVYKEAKES 
VDALFVNSQY DjLQADTNIA MIHAADKRVH RIREAYLPEL SVIPGVNo-I 
FEEL j GRIFT A — LYDARNV IKNGzFNNCT. -CWNVKSIVD VEEQNNHRSV 

1001 

LWPEWEAEV SQEVRVCPGR GYILRVTAYK EGYGEGCVTI HEbxNNTDEL 
KFSNC-xEjV YPzNTVuCND Y— N-X. .-Z A— SRNRGYD E-YxSNSSbP 
ADYA-VYEEj -YTDGjRzNP CE-NRG. . .- TPLPAGYVT- ELEYPPETD- 
VWbEIGETEG TPIVDSVELL LMEE 



Figure 1 



